SD.Next Release 2025-05-12 #3930

vladmandic · 2025-05-12T15:19:53Z

vladmandic
May 12, 2025
Maintainer

SD.Next Release 2025-05-12

First of all NNCF quantization engine has gone through some major enhancements and its now much faster, both in quantization as well as actual inference!
And its a only truly cross-platform solution for quantization as all other methods are platform specific.

Note if you're a ZLUDA user, see notes on GPU driver compatibility as recent Andrenaline drivers do cause problems!
And if you're a ROCm user, this release brings much faster compile times on Linux as well as first (experimental) builds for Windows!

ReadMe | ChangeLog | Docs | WiKi | Discord

Details for 2025-05-12

Compute
- NNCF
  - Faster quantization
  - Faster inference with support for torch.triton
    up to 3.5x faster with INT4 and 2x faster with INT8
  - New settings: NNCF -> Group size
    default is a balance between performance (higher size) and quality (lower size)
    0 is default at 64, -1 disables grouping
- ZLUDA:
  - warning: AMD Adrenaline 25.5.1 drivers are NOT COMPATIBLE with ZLUDA
    see issue for details
- ROCm
  - first working builds of Torch with ROCm on Windows
    highly experimental
    reach out on Discord if you want to test it
Features
- Prompt Enhancer: support for img2img workflows
  in img2img prompt enhancer will first analyze input image and then incorporate user prompt to create enhanced prompt
- FramePack
  - improve LoRA compatibility
  - add metadata to video
- UI
  - ModernUI: support for History tab
  - ModernUI: support for FramePack tab
- API
  - add /sdapi/v1/framepack endpoint with full support for FramePack including all optional settings
    see example: sd-extension-framepack/create-video.py
  - add /sdapi/v1/checkpoint endpoint to get info on currently loaded model/checkpoint
    see example: cli/api-checkpoint.py
  - add /sdapi/v1/prompt-enhance endpoint to enhance prompt using LLM
    see example: cli/api-enhance.py
    supports text, image and video prompts with or without input image
    note: if input image is provided, model should be left at default gemma-3-4b-it as most other LLMs do not support hybrid workflows
Fixes
- Latent Diffusion Upscale
- Model load: support SDXL safetensors packaged without VAE
- ROCm: disable cuDNN benchmark, fixes slow MIOpen tuning with torch==2.7
- Extensions: use in-process installer for extensions-builtin, improves startup performance
- FramePack: monkey-patch for dynamically installed av
- Logging: reduce spam while progress is active
- LoRA: legacy handler enable/disable
- LoRA: force clear-cache on model unload
- ADetailer: fix enable/disable
- ZLUDA: improve compatibility with older GPUs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

SD.Next Release 2025-05-12 #3930

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

SD.Next Release 2025-05-12 #3930

Uh oh!

vladmandic May 12, 2025 Maintainer