Skip to content

Lora for remaining transformer models to inference_exp #1462

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 14 commits into
base: main
Choose a base branch
from

Conversation

hansent
Copy link
Contributor

@hansent hansent commented Jul 29, 2025

Description

This add inference_exp support for remaining transformer models that we can fine tune with LoRA on roboflow.

  • qwen model implementation added to inference_exp
  • paligemma: added lora/peft support and fixed preprocessing
  • smolvlm: added lora/peft support and fixed preprocessing, changed to pass tensors instead of converting to PIL images

for all three models:

  • each is registered on staging and prod model registry as base model and one lora test model packages are:
  • registered base models: paligemma2-3b-pt-224, qwen25vl-7b, smolvlm-256m, smolvlm (regular 1.7B size)
  • registered lora test models: paligemma-lora-test, qwen-lora-test, smolvlm-lora-test
  • e2e tests that load based and lora version model from model registry and run predictions
  • model prediction tests
  • preprocessing tests

Type of change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

integration tests locally

Any specific deployment considerations

n/a

Docs

n/a

@@ -121,6 +121,14 @@
module_name="inference_exp.models.paligemma.paligemma_hf",
class_name="PaliGemmaHF",
),
("smolvlm", VLM_TASK, BackendType.HF): LazyClass(
Copy link
Collaborator

@PawelPeczek-Roboflow PawelPeczek-Roboflow Jul 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be smolvlm-v2 probably

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changed

@hansent hansent changed the title Lora for remaining transformer models Lora for remaining transformer models to inference_exp Jul 30, 2025
@hansent hansent marked this pull request as ready for review July 30, 2025 21:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants