Tidy app entrypoint #7668

RyanJDick · 2025-02-21T21:25:04Z

Summary

Prior to this PR, most of the app setup was being done in api_app.py at import time. This PR cleans this up, by:

Splitting app setup into more modular functions
Narrower responsibility for the api_app.py file - it just initializes the FastAPI app

The main motivation for this changes is to make it easier to support an upcoming torch configuration feature that requires more careful ordering of app initialization steps.

Related Issues / Discussions

N/A

QA Instructions

Launch the app via invokeai-web.py and smoke test it.
Launch the app via the installer and smoke test it.
Test that generate_openapi_schema.py produces the same result before and after the change.
No regression in unit tests that directly interact with the app. (test_images.py)

Merge Plan

Check to see if there are any commercial implications to modifying the app entrypoint.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

invokeai/app/util/startup_utils.py

psychedelicious

I've improved custom node loading in #7698, moving that code to a function instead of running it implicitly as python loads modules.

Maybe should be re-organised per your other changes.

…iable.

…esponsible for just initializing the FastAPI app. This also gives clearer control over the order of the initialization steps, which will be important as we add planned torch configurations that must be applied before torch is imported.

…to review comment.

…7673) ## Summary This PR adds a `pytorch_cuda_alloc_conf` config flag to control the torch memory allocator behavior. - `pytorch_cuda_alloc_conf` defaults to `None`, preserving the current behavior. - The configuration options are explained here: https://pytorch.org/docs/stable/notes/cuda.html#optimizing-memory-usage-with-pytorch-cuda-alloc-conf. Tuning this configuration can reduce peak reserved VRAM and improve performance. - Setting `pytorch_cuda_alloc_conf: "backend:cudaMallocAsync"` in `invokeai.yaml` is expected to work well on many systems. This is a good first step for those looking to tune this config. (We may make this the default in the future.) - The optimal configuration seems to be dependent on a number of factors such as device version, VRAM, CUDA kernel version, etc. For now, users will have to experiment with this config to see if it hurts or helps on their systems. In most cases, I expect it to help. ### Memory Tests ``` VAE decode memory usage comparison: - SDXL, fp16, 1024x1024: - `cudaMallocAsync`: allocated=2593 MB, reserved=3200 MB - `native`: allocated=2595 MB, reserved=4418 MB - SDXL, fp32, 1024x1024: - `cudaMallocAsync`: allocated=3982 MB, reserved=5536 MB - `native`: allocated=3982 MB, reserved=7276 MB - SDXL, fp32, 1536x1536: - `cudaMallocAsync`: allocated=8643 MB, reserved=12032 MB - `native`: allocated=8643 MB, reserved=15900 MB ``` ## Related Issues / Discussions N/A ## QA Instructions - [x] Performance tests with `pytorch_cuda_alloc_conf` unset. - [x] Performance tests with `pytorch_cuda_alloc_conf: "backend:cudaMallocAsync"`. ## Merge Plan - [x] Merge #7668 first and change target branch to `main` ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_

github-actions bot added the python PRs that change python files label Feb 21, 2025

keturn reviewed Feb 21, 2025

View reviewed changes

invokeai/app/util/startup_utils.py Outdated Show resolved Hide resolved

RyanJDick force-pushed the ryan/tidy-entry branch from 79f7b8e to 9fa365f Compare February 24, 2025 15:32

RyanJDick mentioned this pull request Feb 24, 2025

Add pytorch_cuda_alloc_conf config to tune VRAM memory allocation #7673

Merged

7 tasks

RyanJDick marked this pull request as ready for review February 24, 2025 19:51

RyanJDick requested review from blessedcoolant, psychedelicious, brandonrising and hipsterusername as code owners February 24, 2025 19:51

psychedelicious approved these changes Feb 24, 2025

View reviewed changes

psychedelicious self-requested a review February 27, 2025 01:17

psychedelicious reviewed Feb 27, 2025

View reviewed changes

RyanJDick added 8 commits February 28, 2025 20:08

Move find_port() util to its own file.

6f1dcf3

Move check_cudnn() and jurigged setup to startup_utils.py.

35910d3

Simplify port selection logic to avoid the need for a global port var…

ca23b53

…iable.

Create an apply_monkeypatches() start util.

f345c0f

Add register_mime_types() startup util.

38991ff

Make InvokeAILogger an inline import in startup_utils.py in response …

da2b681

…to review comment.

Move load_custom_nodes() to run_app() entrypoint.

1e2c7c5

RyanJDick force-pushed the ryan/tidy-entry branch from 9ba2713 to 1e2c7c5 Compare February 28, 2025 20:54

RyanJDick merged commit 26730ca into main Feb 28, 2025
15 checks passed

RyanJDick deleted the ryan/tidy-entry branch February 28, 2025 21:07

psychedelicious mentioned this pull request Apr 7, 2025

feat(app): restore "Using torch device" message on startup #7888

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tidy app entrypoint #7668

Tidy app entrypoint #7668

RyanJDick commented Feb 21, 2025 •

edited

Loading

psychedelicious left a comment

Tidy app entrypoint #7668

Tidy app entrypoint #7668

Conversation

RyanJDick commented Feb 21, 2025 • edited Loading

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

psychedelicious left a comment

Choose a reason for hiding this comment

RyanJDick commented Feb 21, 2025 •

edited

Loading