[`fix`] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320

tomaarsen · 2025-04-11T13:19:18Z

Hello!

Pull Request overview

Improve how all_layer_embeddings are determined

Details

In short, instead of return_dict=False where we just get a tuple without any keys, we now use return_dict=True, allowing us to see the output names. transformers seems to have a pretty strong convention to use hidden_states for the output_hidden_states, so this should be fairly safe. Beyond that, I'm still using [0] indexing for the token embeddings (also known as last_hidden_state).

My intention is not to introduce any backwards breaking here, but there's always a risk. Let me know if your code breaks because of this!

Tom Aarsen

… are determined

Copilot

Copilot reviewed 1 out of 1 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (1)

sentence_transformers/models/Transformer.py:443

[nitpick] Using indexing with [0] to extract token embeddings may be unclear now that a dictionary is returned; consider using outputs['last_hidden_state'] to improve code clarity and maintain consistency with typical transformers output.

token_embeddings = outputs[0]

Use return_dict=True in Transformer; improve how all_layer_embeddings…

b4fa371

… are determined

tomaarsen requested a review from Copilot April 14, 2025 11:28

Copilot AI reviewed Apr 14, 2025

View reviewed changes

tomaarsen merged commit 03dff58 into UKPLab:master Apr 14, 2025
1 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[`fix`] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320

[`fix`] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320

Uh oh!

tomaarsen commented Apr 11, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

[fix] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320

[fix] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320

Uh oh!

Conversation

tomaarsen commented Apr 11, 2025

Pull Request overview

Details

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[`fix`] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320

[`fix`] Use return_dict=True in Transformer; improve how all_layer_embeddings are determined #3320