[Feature] Dynamic Lora Support in SGLang #2686

grahama1970 · 2024-12-31T12:20:29Z

Checklist

1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
2. Please use English, otherwise it will be closed.

Motivation

In SGLang, I would to dynamically apply domain-specific Lora adapters to smaller/local models. Normally, I use SGLang for inference. Recently, I've switched to Vllm which already has the ability to unload/load adaptors: https://docs.vllm.ai/en/latest/usage/lora.html
If this feature is already exists in SGlang, can you add an example in the documentation?

Related resources

https://docs.vllm.ai/en/latest/usage/lora.html

zhaochenyang20 · 2024-12-31T22:49:14Z

@Ying1123 and @Wenyueh is on this. Thanks!

yileld · 2025-01-07T07:53:24Z

it easy to implement dynamic lora, just copy lora_manager.init_loras() as lora_manager.update_loras(), and add a judge
`

    if not hasattr(self, 'lora_modules'):
        self.lora_modules = []
    for module_name, module in self.get_target_modules():
        lora_module = self.set_lora_module(module_name, module)
        if lora_module:
            self.lora_modules.append(
                (module_name, lora_module)
            )

`
then use lora_manager.update_loras() before model_runner.lora_manager.prepare_lora_batch(ret).

Fridge003 · 2025-01-07T18:56:29Z

Can this issue be solved with the answer by @yileld above? Is it necessary to open a PR to fix up this issue？@Ying1123

grahama1970 · 2025-01-09T16:20:59Z

If it's more complex than the above solution, I'd love a coded explanation on how Loras are applied to the model, an, if multiple adaptors can be applied simultaneously, from different knowledge bases like finance and math adaptors.

zhaochenyang20 · 2025-01-09T21:25:57Z

If it's more complex than the above solution, I'd love a coded explanation on how Loras are applied to the model, an, if multiple adaptors can be applied simultaneously, from different knowledge bases like finance and math adaptors.

@Fridge003

Fridge003 · 2025-01-10T07:10:15Z

If it's more complex than the above solution, I'd love a coded explanation on how Loras are applied to the model, an, if multiple adaptors can be applied simultaneously, from different knowledge bases like finance and math adaptors.

After reading relevant codes, I feel the above solution is not enough to support the feature well because LoraManager instance has many attributes to maintain and update. I will try to come up with a better solution.

zhaochenyang20 · 2025-01-10T08:28:32Z

@Fridge003 Thanks for pointing this out, and I hope to see your solutions!

grahama1970 · 2025-01-21T22:55:53Z

Hi. Has there been any progress on Dynamically switching LorRas in SGLang? Thanks in advance

zhaochenyang20 · 2025-01-21T23:02:50Z

@Fridge003

Hi. Has there been any progress on Dynamically switching LorRas in SGLang? Thanks in advance

Fridge003 · 2025-01-22T00:43:21Z

Hi. Has there been any progress on Dynamically switching LorRas in SGLang? Thanks in advance

Hi, this feature is still under development and I have started a relevant PR draft #2891. The feature design is also described in this PR.

zhaochenyang20 · 2025-01-22T04:20:15Z

Cool! @Fridge003

github-actions · 2025-03-24T00:18:32Z

This issue has been automatically closed due to inactivity. Please feel free to reopen it if needed.

sunzx8 · 2025-04-18T05:21:34Z

Hi. Has there been any progress on Dynamically switching LorRas in SGLang? Thanks in advance

Hi, this feature is still under development and I have started a relevant PR draft #2891. The feature design is also described in this PR.

Hi, has any updates now? thanks for the hard work in advance

zhaochenyang20 added the lora label Dec 31, 2024

Fridge003 mentioned this issue Jan 14, 2025

[Feature] Support dynamic loading and unloading of Lora adapters #2891

Closed

3 tasks

github-actions bot closed this as completed Mar 24, 2025

github-actions bot added the inactive label Mar 24, 2025

Fridge003 reopened this Mar 24, 2025

zhaochenyang20 changed the title ~~[Feature] Dynamic Lora Support in SGLang (like VLLM)~~ [Feature] Dynamic Lora Support in SGLang Apr 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Dynamic Lora Support in SGLang #2686

[Feature] Dynamic Lora Support in SGLang #2686

grahama1970 commented Dec 31, 2024

zhaochenyang20 commented Dec 31, 2024

Uh oh!

yileld commented Jan 7, 2025 •

edited

Loading

Uh oh!

Fridge003 commented Jan 7, 2025

Uh oh!

grahama1970 commented Jan 9, 2025

Uh oh!

zhaochenyang20 commented Jan 9, 2025

Uh oh!

Fridge003 commented Jan 10, 2025

Uh oh!

zhaochenyang20 commented Jan 10, 2025

Uh oh!

grahama1970 commented Jan 21, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

Fridge003 commented Jan 22, 2025

Uh oh!

zhaochenyang20 commented Jan 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 24, 2025

Uh oh!

sunzx8 commented Apr 18, 2025

Uh oh!

[Feature] Dynamic Lora Support in SGLang #2686

[Feature] Dynamic Lora Support in SGLang #2686

Comments

grahama1970 commented Dec 31, 2024

Checklist

Motivation

Related resources

zhaochenyang20 commented Dec 31, 2024

Uh oh!

yileld commented Jan 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fridge003 commented Jan 7, 2025

Uh oh!

grahama1970 commented Jan 9, 2025

Uh oh!

zhaochenyang20 commented Jan 9, 2025

Uh oh!

Fridge003 commented Jan 10, 2025

Uh oh!

zhaochenyang20 commented Jan 10, 2025

Uh oh!

grahama1970 commented Jan 21, 2025

Uh oh!

zhaochenyang20 commented Jan 21, 2025

Uh oh!

Fridge003 commented Jan 22, 2025

Uh oh!

zhaochenyang20 commented Jan 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 24, 2025

Uh oh!

sunzx8 commented Apr 18, 2025

Uh oh!

yileld commented Jan 7, 2025 •

edited

Loading

zhaochenyang20 commented Jan 22, 2025 •

edited

Loading