SD.Next Release 2025-05-12 #3930
Locked
vladmandic
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
SD.Next Release 2025-05-12
First of all NNCF quantization engine has gone through some major enhancements and its now much faster, both in quantization as well as actual inference!
And its a only truly cross-platform solution for quantization as all other methods are platform specific.
Note if you're a ZLUDA user, see notes on GPU driver compatibility as recent Andrenaline drivers do cause problems!
And if you're a ROCm user, this release brings much faster compile times on Linux as well as first (experimental) builds for Windows!
ReadMe | ChangeLog | Docs | WiKi | Discord
Details for 2025-05-12
torch.triton
up to 3.5x faster with INT4 and 2x faster with INT8
default is a balance between performance (higher size) and quality (lower size)
0 is default at 64, -1 disables grouping
see issue for details
highly experimental
reach out on Discord if you want to test it
in img2img prompt enhancer will first analyze input image and then incorporate user prompt to create enhanced prompt
/sdapi/v1/framepack
endpoint with full support for FramePack including all optional settingssee example:
sd-extension-framepack/create-video.py
/sdapi/v1/checkpoint
endpoint to get info on currently loaded model/checkpointsee example:
cli/api-checkpoint.py
/sdapi/v1/prompt-enhance
endpoint to enhance prompt using LLMsee example:
cli/api-enhance.py
supports text, image and video prompts with or without input image
note: if input image is provided, model should be left at default
gemma-3-4b-it
as most other LLMs do not support hybrid workflowstorch==2.7
av
Beta Was this translation helpful? Give feedback.
All reactions