Lora for remaining transformer models to inference_exp #1462

hansent · 2025-07-29T13:49:46Z

Description

This add inference_exp support for remaining transformer models that we can fine tune with LoRA on roboflow.

qwen model implementation added to inference_exp
paligemma: added lora/peft support and fixed preprocessing
smolvlm: added lora/peft support and fixed preprocessing, changed to pass tensors instead of converting to PIL images

for all three models:

each is registered on staging and prod model registry as base model and one lora test model packages are:
registered base models: paligemma2-3b-pt-224, qwen25vl-7b, smolvlm-256m, smolvlm (regular 1.7B size)
registered lora test models: paligemma-lora-test, qwen-lora-test, smolvlm-lora-test
e2e tests that load based and lora version model from model registry and run predictions
model prediction tests
preprocessing tests

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

integration tests locally

Any specific deployment considerations

n/a

Docs

n/a

…ansformer-models

PawelPeczek-Roboflow · 2025-07-30T18:00:46Z

inference_experimental/inference_exp/models/auto_loaders/models_registry.py

@@ -121,6 +121,14 @@
        module_name="inference_exp.models.paligemma.paligemma_hf",
        class_name="PaliGemmaHF",
    ),
+    ("smolvlm", VLM_TASK, BackendType.HF): LazyClass(


should be smolvlm-v2 probably

hansent added 5 commits July 28, 2025 15:35

add lora/perf weights loading for smolvlm2 and paligemma

03f1026

remove print statements

7875f93

Merge remote-tracking branch 'origin/main' into lora-for-remaining-tr…

bc2c720

…ansformer-models

register smolvlm and add e2e tests for paligemma and smolvlm

4079647

add qwen model implementation

f5e233d

PawelPeczek-Roboflow reviewed Jul 30, 2025

View reviewed changes

hansent added 6 commits July 30, 2025 15:54

add qwen tests

4085592

paligemma model and preprocessing tests

e76bb80

smolvlm preprocessing and prediction tests

c26f438

fix color conversion and e2e tests

3633dcf

fix paligemma model tests now that we fixed the color conversion

1ee8070

fix qwen model tests

8f6f7e2

hansent changed the title ~~Lora for remaining transformer models~~ Lora for remaining transformer models to inference_exp Jul 30, 2025

hansent marked this pull request as ready for review July 30, 2025 21:41

hansent requested review from grzegorz-roboflow, yeldarby and probicheaux as code owners July 30, 2025 21:41

hansent added 3 commits July 31, 2025 08:54

Merge branch 'main' into lora-for-remaining-transformer-models

1edc92b

Merge branch 'main' into lora-for-remaining-transformer-models

9b69bb1

update model architechture string for smolvlm to smolvlm-v2

4ac5772

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lora for remaining transformer models to inference_exp #1462

Lora for remaining transformer models to inference_exp #1462

Uh oh!

hansent commented Jul 29, 2025 •

edited

Loading

Uh oh!

PawelPeczek-Roboflow Jul 30, 2025 •

edited

Loading

Uh oh!

hansent Jul 31, 2025

Uh oh!

Uh oh!

Lora for remaining transformer models to inference_exp #1462

Are you sure you want to change the base?

Lora for remaining transformer models to inference_exp #1462

Uh oh!

Conversation

hansent commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

PawelPeczek-Roboflow Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hansent Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hansent commented Jul 29, 2025 •

edited

Loading

PawelPeczek-Roboflow Jul 30, 2025 •

edited

Loading