Create a gist now

Instantly share code, notes, and snippets.

What would you like to do?
Asynchronous requests in Flask with gevent
"""Asynchronous requests in Flask with gevent"""
from time import time
from flask import Flask, Response
from gevent.pywsgi import WSGIServer
from gevent import monkey
import requests
# need to patch sockets to make requests async
CHUNK_SIZE = 1024*1024 # bytes
app = Flask(__name__) # pylint: disable=invalid-name
app.debug = True
def seattle(requests_counter=[0]): # pylint: disable=dangerous-default-value
"""Asynchronous non-blocking streaming of relatively large (14.5MB) JPG
of Seattle from wikimedia commons.
requests_counter[0] += 1
request_num = requests_counter[0]
url = ''
app.logger.debug('started %d', request_num)
rsp = requests.get(url, stream=True)
def generator():
"streaming generator logging the end of request processing"
yield '' # to make greenlet switch
for data in rsp.iter_content(CHUNK_SIZE):
yield data
app.logger.debug('finished %d', request_num)
return Response(generator(), mimetype='image/jpeg')
def main():
"Start gevent WSGI server"
# use gevent WSGI server instead of the Flask
http = WSGIServer(('', 5000), app.wsgi_app)
# TODO gracefully handle shutdown
if __name__ == '__main__':

Thanks for your demo. BTW, can I achieve the same non-blocking effect in a long time-consuming work? For example, my api /longtime will do 10 same backgroud job in multi-thread mode, and each job is time-cunsuming but returns nothing.

Thank for your sharing , but i found that not worked as async. One request start to be served after previous one done.

I found this only work for stream reading. If you want async, queue and multiprocessing would be a good start.


sergray commented Feb 7, 2018

Yes, there is actually a problem with requests library working in a blocking mode, so while flask accept several HTTP requests, responses are streamed with concurrency 1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment