Overhaul `solve_dependencies()` for performance #5554

kristjanvalur · 2022-10-28T16:28:56Z

This pr provides an overhaul of solve_dependencies(). It is aimed at fixing performance and increasing readability.

A hidden performance bug is removed, when an extra level of dependencies is traversed, even if the cache is hit.
A hidden performance bug is removed when a new Depends is created for every dependency if there are any dependency overrides.
The evaluation of builtin-dependencies is moved into specialized callables/closures to avoid a lengthy series of if statements for every Dependency evaluation
the call-context of dependency solving is moved into its own DependencySolverContext dataclass avoiding a lot of boilerplate code in the main solve_dependencies() function

A new test is added to test deep Depends hierarchies and caching.

A specialized unittests is also added which uses the timeit module to measure performance of Depends processing. We run an AsyncClient to reduce overhead, for complete requests. Also direct evaluation of the endpoint dependencies, both for Depends and also Request . (The test does nothing by default using if False:. Could also add a pytest switch to enable it.)

On a local PC, I measure, with the pytest_cahce_perf unittest enabled and pytest --quiet -k test_deep_cache_perf unittest:

before:

tests/test_dependency_perf.py deep cache client requests
did 1000 calls in 0.3157117000000653 seconds
time per call: 0.32ms, rate: 3167.45/s
deep cache direct solve
did 2000 calls in 0.2325039000002107 seconds
time per call: 0.12ms, rate: 8602.01/s
request direct solve
did 50000 calls in 0.3208132000004298 seconds
time per call: 0.01ms, rate: 155853.94/s

after:

tests/test_dependency_perf.py deep cache client requests
did 1000 calls in 0.2511885999997503 seconds
time per call: 0.25ms, rate: 3981.07/s
deep cache direct solve
did 5000 calls in 0.2859181999997418 seconds
time per call: 0.06ms, rate: 17487.52/s
request direct solve
did 100000 calls in 0.3904711000000134 seconds
time per call: 0.00ms, rate: 256100.90/s

Performance of deep Depends hierarchies is doubled, mostly due to the fixing of the cache mechanic. But simple dependencies, like the Request, are also faster, since the whole depdendency evaluation loop is simpler, and we use dispatch functions to fill the parameters rather than switch statements.

Running the entire testsuite, with:
pytest --quiet tests
reports 28.4s before and 28.1 after the change. The whole testsuite however typically creates an app and runs the endpoint once. Also, there is tremendous overhead in using TestClient rather than AsyncClient for each endpoint call. The optimizations are expected to trade app init time for endpoint execution time.

github-actions · 2022-10-28T16:33:26Z

📝 Docs preview for commit da025ba at: https://635c04265b1bfe0e67529590--fastapi.netlify.app

github-actions · 2022-10-28T16:44:57Z

📝 Docs preview for commit 82a7ed5 at: https://635c06dc1f721607753f1568--fastapi.netlify.app

github-actions · 2022-10-28T17:22:47Z

📝 Docs preview for commit 76338e2 at: https://635c0fbb1f24421dadf274eb--fastapi.netlify.app

codecov · 2022-10-28T17:31:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (cf73051) 100.00% compared to head (059a07f) 100.00%.
Report is 1074 commits behind head on master.

❗ Current head 059a07f differs from pull request most recent head 4336a43. Consider uploading reports for the commit 4336a43 to get more accurate results

Additional details and impacted files

@@            Coverage Diff            @@
##            master     #5554   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files          540       541    +1     
  Lines        13969     14064   +95     
=========================================
+ Hits         13969     14064   +95

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2022-10-28T17:37:24Z

📝 Docs preview for commit 80e12c2 at: https://635c131d2c15d81992024860--fastapi.netlify.app

github-actions · 2022-10-28T17:42:47Z

📝 Docs preview for commit 040cd2a at: https://635c146a802d901f8f3e73c7--fastapi.netlify.app

github-actions · 2022-10-28T17:46:47Z

📝 Docs preview for commit 4506386 at: https://635c1552802d90211c3e7302--fastapi.netlify.app

github-actions · 2022-10-31T12:01:43Z

📝 Docs preview for commit 53fcbdb at: https://635fb8f3c22353274c583c28--fastapi.netlify.app

github-actions · 2022-10-31T14:48:51Z

📝 Docs preview for commit 395c3d4 at: https://635fe01e1f72164ed13f158a--fastapi.netlify.app

github-actions · 2022-10-31T16:06:45Z

📝 Docs preview for commit 22585d7 at: https://635ff257043bc468f7e37821--fastapi.netlify.app

github-actions · 2022-10-31T16:36:18Z

📝 Docs preview for commit 8ad762f at: https://635ff94f43976500ba2067d3--fastapi.netlify.app

github-actions · 2022-10-31T21:34:30Z

📝 Docs preview for commit 0e6d410 at: https://63603f299481443aa9c033c7--fastapi.netlify.app

github-actions · 2022-10-31T21:58:12Z

📝 Docs preview for commit 1a2d498 at: https://636044c4e0908c0077cd5f54--fastapi.netlify.app

github-actions · 2022-10-31T22:07:19Z

📝 Docs preview for commit 059a07f at: https://636046dc56950d004e41a6d5--fastapi.netlify.app

This avoids unnecessary recursion which is subsequently discarded.

…eded.

The documentation does not list `Response` as as valid dependency for WebSocket handlers.

…rameter to endpoints.

github-actions · 2022-11-04T14:09:18Z

📝 Docs preview for commit 7c30b54 at: https://63651ce7c401f802a191f702--fastapi.netlify.app

odiseo0 · 2022-11-04T14:21:11Z

fastapi/dependencies/utils.py

@@ -454,77 +455,76 @@ async def solve_generator(
    return await stack.enter_async_context(cm)


+@dataclasses.dataclass
+class DependencySolverContext:


Why are you using dataclases.dataclass here instead of Pydantic?

I just want to know if there's a reason in particular

Dataclass is basically a quick way to declare a class with initializers. I'm not looking for pydantic's validation. This is supposed to be low overhead.
Could have used a plain class instead and written my own __init__().
Could also have made the solve_dependencies() an instance method, but I don't see much of that kind of programming style in FastAPI so I left it functional.

Indeed, very good PR. I really liked it.

github-actions · 2022-12-01T14:11:37Z

📝 Docs preview for commit 63dfd34 at: https://6388b5ef29572b004e385306--fastapi.netlify.app

github-actions · 2022-12-16T21:42:44Z

📝 Docs preview for commit 1800be7 at: https://639ce634abc4ab0873a92fe0--fastapi.netlify.app

github-actions · 2022-12-18T10:43:54Z

📝 Docs preview for commit 1b332f2 at: https://639eeec6e97dd36863989e88--fastapi.netlify.app

sk- · 2022-11-06T22:37:28Z

fastapi/dependencies/utils.py

+        context.values = values
+        context.errors = errors
+        for dependency_getter in dependant.dependency_getters:
+            if inspect.iscoroutinefunction(dependency_getter):


Would it make sense to have two list of getters in order to avoid the iscoroutine check?

Also if it's done in that way, one could easily use gather instead of waiting each coroutine individually.

Sure, we can do that, but see my other comment about automatically gathering. We'd have to to some performance tests to make sure it doesn't slow down simple getters.

Ah, I'm looking at the code, and these are the "built-in" dependencies (Headers, params, etc)
There is only one type of async built-in dependency, and that tis the get_body_params().
This is used for Body() type parameters. And the only kind of Body() parameter where anything async happens is the File() one. FastAPI already schedules FILE sequence reads internally... and there is only one getter created, for all the Body type params. but I'll split it into a separate method and get rid of the iscoroutinefunction test.

There are so few comments in the code, it takes a lot of effort to reverse-engineer this :)

sk- · 2022-11-06T22:43:14Z

fastapi/dependencies/utils.py


-        solved_result = await solve_dependencies(
-            request=request,
+        sub_values, sub_errors = await solve_dependencies(


Should we try to run all these subtasks in parallel? Same for the code below

if is_gen_callable(call) or is_async_gen_callable(call): stack = context.request.scope.get("fastapi_astack") assert isinstance(stack, AsyncExitStack) solved = await solve_generator( call=call, stack=stack, sub_values=sub_values ) elif is_coroutine_callable(call): solved = await call(**sub_values) else: solved = await run_in_threadpool(call, **sub_values) if sub_dependant.name is not None: values[sub_dependant.name] = solved

See comment #5472 (comment)

It's tricky. The thing is often dependencies don't do any IO at all, but they are marked async to be not marked sync. All sync dependencies get run from a threadpool, because they potentially block. This is a huge performance waste for a dependency which does something relatively simple without any IO.

Similarly, calling asyncio.gather on a bunch of async dependencies which don't do any IO is potentially quite wasteful. It means that each will have a Task created and that we have to go and have a whole loop through the event loop while these Tasks are scheduled and run to completion. This will cause some unnecessary latency.

So, I'm not super convinced that it should be automatically done. I think we should leave that up to the application to decide if multiple dependencies can be wrapped in a single common dependency and then explicitly call asyncio.gather

At least, we'd need to do some performance testing before deciding to go down that route. And I'd like for it to be a separate PR, because that would be a significant change in logic.

sk- · 2022-12-18T14:34:53Z

fastapi/dependencies/utils.py

+    for field in dependant.path_params:
+
+        def get_path_param_getter(field: ModelField) -> DependencyGetter:
+            def get_param(context: DependencySolverContext) -> None:


Why are all these getters defined inline? As they don't seem to be using any local state, could you move them out and declare them at the top level?

Note that this is specially necessary for getters defined in a loop as in each iteration it will create a new function, generating more memory garbage.

It's a style decision. My original implementation attempted to not use the outer function at all (e.g. get_path_param_getter(), but crate the closure directly (e.g. get_param(), but that doesn't work in a loop neccessitating the outer helper function. I kept it at the same place as the closure to keep all the logic in the same place.

You need not worry about "memory garbage". The outer function is assigned to a local variable and gets destroyed when the "get_dependent_dependency_getters()" exits. Each iteration just redefines the function. But I can remove it out of the loop if you want. It will just reduce the locality of the logic a bit. Python is nice like that, local functions definitions are just fine.

This also happens during the setup of the endpoints so it is not critical performance-wise.

But I'm happy to move them out to the top level if you think that style is better.

I've moved the definitions out of the loop scope, though.

github-actions · 2022-12-20T12:05:07Z

📝 Docs preview for commit caa8a57 at: https://63a1a4c5e073010797cce777--fastapi.netlify.app

github-actions · 2022-12-20T12:43:11Z

📝 Docs preview for commit cec8eef at: https://63a1adbbe073010ef9cce6fb--fastapi.netlify.app

github-actions · 2022-12-20T13:21:13Z

📝 Docs preview for commit 7e9e36f at: https://63a1b69312177a198aee6326--fastapi.netlify.app

github-actions · 2023-01-12T10:10:05Z

📝 Docs preview for commit 4336a43 at: https://63bfdc4d50d4fd112dacd587--fastapi.netlify.app

kristjanvalur · 2023-02-23T20:36:23Z

Is there any interest in reducing call overhead for FastAPI endpoint dependencies?

nonnibunk · 2023-09-01T09:56:01Z

Could we get this PR merged, I'd love to get this into my project.

Tishka17 · 2024-07-12T08:32:03Z

If you are interested in faster dependency solving, I can suggest my project which can be used with fastapi together (or without it, if you need)

https://github.com/reagento/dishka/tree/develop

kristjanvalur force-pushed the kristjan/depends branch from 76338e2 to 6435409 Compare October 28, 2022 17:26

kristjanvalur force-pushed the kristjan/depends branch from 80e12c2 to 040cd2a Compare October 28, 2022 17:37

kristjanvalur changed the title ~~Overhaul solve_dependencies()~~ Overhaul solve_dependencies() for performance Oct 28, 2022

kristjanvalur marked this pull request as ready for review October 28, 2022 17:45

kristjanvalur force-pushed the kristjan/depends branch from 4506386 to 53fcbdb Compare October 31, 2022 11:57

This was referenced Oct 31, 2022

Performance problem with dependency evaluation #5562

Closed

Make Websocket context_getter dependency async strawberry-graphql/strawberry#2278

Merged

tiangolo added hacktoberfest-accepted and removed hacktoberfest-accepted labels Oct 31, 2022

kristjanvalur force-pushed the kristjan/depends branch from 8ad762f to 0e6d410 Compare October 31, 2022 21:29

kristjanvalur force-pushed the kristjan/depends branch from 0e6d410 to 1a2d498 Compare October 31, 2022 21:53

kristjanvalur force-pushed the kristjan/depends branch from 1a2d498 to 059a07f Compare October 31, 2022 22:02

samuelcolvin and others added 4 commits November 4, 2022 14:05

bump

d4804b1

Adding test file for deep dependency cache hierarchies and performance

956ef11

Perform cache lookup before evaluating sub-dependencies.

4cfd4ad

This avoids unnecessary recursion which is subsequently discarded.

Optimize dependency overrides. Don't create a new dependent unless ne…

646e6b3

…eded.

kristjanvalur added 2 commits November 4, 2022 14:05

Create temporary response object only on demand and not for WebSockets.

2a379a1

The documentation does not list `Response` as as valid dependency for WebSocket handlers.

replace "temporal" with "temporary" when explaining the "Response" pa…

7c30b54

…rameter to endpoints.

kristjanvalur force-pushed the kristjan/depends branch from 059a07f to 7c30b54 Compare November 4, 2022 14:05

odiseo0 reviewed Nov 4, 2022

View reviewed changes

tiangolo added the investigate label Nov 4, 2022

kristjanvalur added 2 commits December 1, 2022 14:02

Merge remote-tracking branch 'upstream/master' into kristjan/depends

62f9700

lint

63dfd34

kristjanvalur mentioned this pull request Dec 8, 2022

get_context() is only resolved once per WS connection for FastAPI strawberry-graphql/strawberry#1754

Open

Merge branch 'master' into kristjan/depends

1800be7

Merge branch 'master' into kristjan/depends

1b332f2

sk- reviewed Dec 18, 2022

View reviewed changes

Store separate list for async built-in dependencies.

cec8eef

kristjanvalur force-pushed the kristjan/depends branch from caa8a57 to cec8eef Compare December 20, 2022 12:39

Move local function definitions out of loop scope

7e9e36f

Merge branch 'master' into kristjan/depends

4336a43

alejsdev added feature New feature or request p3 and removed investigate labels Jan 15, 2024

Overhaul solve_dependencies() for performance #5554

Are you sure you want to change the base?

Overhaul solve_dependencies() for performance #5554

Conversation

kristjanvalur commented Oct 28, 2022 • edited Loading

github-actions bot commented Oct 28, 2022

github-actions bot commented Oct 28, 2022

github-actions bot commented Oct 28, 2022

codecov bot commented Oct 28, 2022 • edited Loading

Codecov Report

github-actions bot commented Oct 28, 2022

github-actions bot commented Oct 28, 2022

github-actions bot commented Oct 28, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Oct 31, 2022

github-actions bot commented Nov 4, 2022

odiseo0 Nov 4, 2022

Choose a reason for hiding this comment

kristjanvalur Nov 4, 2022 • edited Loading

Choose a reason for hiding this comment

odiseo0 Nov 4, 2022

Choose a reason for hiding this comment

github-actions bot commented Dec 1, 2022

github-actions bot commented Dec 16, 2022

github-actions bot commented Dec 18, 2022

sk- Nov 6, 2022

Choose a reason for hiding this comment

kristjanvalur Dec 19, 2022 • edited Loading

Choose a reason for hiding this comment

kristjanvalur Dec 20, 2022 • edited Loading

Choose a reason for hiding this comment

kristjanvalur Dec 20, 2022

Choose a reason for hiding this comment

sk- Nov 6, 2022

Choose a reason for hiding this comment

kristjanvalur Dec 19, 2022 • edited Loading

Choose a reason for hiding this comment

sk- Dec 18, 2022

Choose a reason for hiding this comment

kristjanvalur Dec 19, 2022

Choose a reason for hiding this comment

kristjanvalur Dec 20, 2022 • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Dec 20, 2022

github-actions bot commented Dec 20, 2022

github-actions bot commented Dec 20, 2022

github-actions bot commented Jan 12, 2023

kristjanvalur commented Feb 23, 2023 • edited Loading

nonnibunk commented Sep 1, 2023

Tishka17 commented Jul 12, 2024

Overhaul `solve_dependencies()` for performance #5554

Overhaul `solve_dependencies()` for performance #5554

kristjanvalur commented Oct 28, 2022 •

edited

Loading

codecov bot commented Oct 28, 2022 •

edited

Loading

kristjanvalur Nov 4, 2022 •

edited

Loading

kristjanvalur Dec 19, 2022 •

edited

Loading

kristjanvalur Dec 20, 2022 •

edited

Loading

kristjanvalur Dec 19, 2022 •

edited

Loading

kristjanvalur Dec 20, 2022 •

edited

Loading

kristjanvalur commented Feb 23, 2023 •

edited

Loading