Skip to content

Additional Trainer Argument for features of different modalities #3225

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 14 commits into
base: master
Choose a base branch
from

Conversation

mengerj
Copy link

@mengerj mengerj commented Feb 11, 2025

Hi,

I have been working on multi-modal embedding models based on the sentence-transformers framework and found that I need to hard code keys into the "collect_features" method within the trainer. I simply added an argument to allow passing a list of additional keys that might be present in your features (similar to "pixel_values" for CLIP). I would appreciate a merge so I don't need to rely on my fork for the multi-modal embedding model.

Best,
Jonatan

@tomaarsen
Copy link
Collaborator

Hello!

I think this is indeed quite valuable to support, so I'd like to help you out here. However, I think we might be better off going in a different direction that should work out of the box without the user having to ever specify anything: #3276

Feel free to give me some feedback there!

Also, apologies for the delays. I've been focusing on the huge #3222, which is now nearing completion.

  • Tom Aarsen

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants