Add streaming backpressure #152

pgjones · 2022-07-07T18:03:15Z

Some more context when this is relevant.

flowchart LR
    httpx-->|slow network| server
      subgraph one[Host 1]
        client-->|localhost\nconnection| quart
        subgraph quart[Quart app]
          httpx[httpx async\nclient]
        end
      end
      subgraph two[Host 2]
        server
      end

In this configuration, the Quart app acts as a proxy. As the httpx client is slowly consuming data because of the network speed, Quart buffers incoming data which is being uploaded at a very high speed because of a localhost connection. For huge payloads, this results in either memory allocation errors, or the out-of-memory killer killing the app.

^ From @andrewsh

laggardkernel · 2022-08-03T05:37:12Z

Doesn't look like a problem for Quart, the ASGI app.

The ASGI separates ASGI server and ASGI app. All the ASGI app gets is scope, receive(), send(). An app reads request data and returns a response. The ASGI server handle connection and data reading, sending.

In my understanding, "adding streaming backpressue" means calling transp.pause_reading() in ASGI app. This seems contradict the ASGI abstration above.

From what I've read,

In asyncio, transp defines pause_reading(), resume_reading() and called by proto.
Proto defines pause_writing(), resume_writing() and called by transp.
It's because from the view of an ASGI app, the app read req data from transp, and resp data is sent from the app.

In ASGI servers like uvicorn, hypercorn, they add more layers under protocol.
e.g. RequestResponseCycle (wrapping around await app()) in uvicorn. HTTPStream, Context.spawn_app() in hypercorn.
Accessibility of transp.pause_reading(), resume_reading() should be extended into these new layers in ASGI servers. Besides, .pause_writing(), resume_writing() should be decided by the new most inner layer but not the proto.

uvicorn introduces a FlowControl layer to share read, write availability between proto and RequestResponseCycle. proto pauses reading if read buffer RequestResponseCycle.body > HIGH_WATER_LIMIT, Onece receive() gets called by the ASGI app, it resumes reading.

The backpressure thing sounds like an ASGI server thing.

pgjones · 2022-08-03T08:31:51Z

It is both. At the moment there is no way for the app to place back pressure on the server. In the ASGI setup this could be done, for example, by not awaiting receive whilst the app catches up.

lordmauve · 2023-08-21T16:30:58Z

Is this about streaming uploads?

I'm seeing 413 errors due to hitting MAX_CONTENT_LENGTH, even though I'm streaming the body to disk. The docs say

This allows larger bodies to be received and consumed if desired. The key being that the data is consumed and not otherwise stored in memory. An example is,
async def route():
   async for data in request.body:
       # Do something with the data
       ...
it is advisable to add a timeout within each chunk if streaming the request.

But this is not working, and it's clear that Body.append() is sync and simply appends to the buffer and then fails with HTTP 413 if MAX_CONTENT_LENGTH is now exceeded. Because it is sync it cannot block if the buffer is "full" so it doesn't transmit backpressure from the application code.

The backpressure would need to be applied here at the ASGI layer such that if the request body buffer is not draining then we don't accept a new ASGI message on this connection.

Wyzard256 · 2025-03-30T18:49:51Z

I've encountered this problem too. The root of it is that receiver_task runs concurrently with handler_task, consuming input from the client and appending it to a bytearray regardless of whether the app's handler is ready to use it. In an application that handles large file uploads, I've had my async for chunk in request.body loop receive chunks more than 5 gigabytes in size, just because that's how much the receiver_task accumulated into the bytearray while the handler_task was working on the previous (also very large) chunk.

Ideally, receiver_task and the handle_messages function shouldn't exist at all. Instead, the ASGIReceiveCallable and the functionality of handle_messages should be embedded into the request wrapper such that in an async for chunk in request.body loop, the request body performs one await receive() on the ASGI callable in each iteration, yielding the bytes to the async loop instead of appending them to a bytearray.

(This would involve the ASGIHTTPConnection._create_request_from_scope method taking a receive parameter in addition to the send parameter. It would also simplify the implementation of ASGIHTTPConnection.__call__, since there'd be only one task to await: no need for asyncio.wait(...) or cancel_task, and probably no need for asyncio.ensure_future either.)

This was referenced Mar 30, 2025

Allow max content length to be modified on a per-request basis #425

Open

Implement backpressure for HTTP request body and WebSocket messages #427

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add streaming backpressure #152

Add streaming backpressure #152

pgjones commented Jul 7, 2022

laggardkernel commented Aug 3, 2022 •

edited

Loading

pgjones commented Aug 3, 2022

lordmauve commented Aug 21, 2023

Wyzard256 commented Mar 30, 2025

Add streaming backpressure #152

Add streaming backpressure #152

Comments

pgjones commented Jul 7, 2022

laggardkernel commented Aug 3, 2022 • edited Loading

pgjones commented Aug 3, 2022

lordmauve commented Aug 21, 2023

Wyzard256 commented Mar 30, 2025

laggardkernel commented Aug 3, 2022 •

edited

Loading