Update longformer.md #37622

JihadHammoud02 · 2025-04-18T19:47:12Z

Refactored Longformer docs
Added examples for pipeline, Automodel and cli
Added quantization
Did not add a Attention visualizer, from what I researched it doesn't support it, if it is not the case I am happy to add it !
Added a note concerning versions < 4.37.0.dev

github-actions · 2025-04-18T19:47:28Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

stevhliu

Nice, thanks for adding!

docs/source/en/model_doc/longformer.md

stevhliu · 2025-04-18T22:27:19Z

docs/source/en/model_doc/longformer.md


-For more information please also refer to [`~LongformerModel.forward`] method.
+Quantization reduces the memory burden of large models by representing the weights in a lower precision. Refer to the [Quantization](../quantization/overview) overview for more available quantization backends.


I don't think we need a Quantization example here since the model isn't that big

stevhliu · 2025-04-18T22:29:18Z

docs/source/en/model_doc/longformer.md

- [Question answering task guide](../tasks/question_answering)
- [Masked language modeling task guide](../tasks/masked_language_modeling)
- [Multiple choice task guide](../tasks/multiple_choice)
+- If you're using Transformers < 4.37.0.dev, set `trust_remote_code=True` in [~AutoModel.from_pretrained]. Otherwise, make sure you update Transformers to the latest stable version.


Not necessary to include this note. Instead, add the below

Longformer is based on RoBERTa and doesn't have token_type_ids. You don't need to indicate which token belongs to which segment. You only need to separate the segments with the separation token </s> or tokenizer.sep_token.

You can set which tokens can attend locally and which tokens attend globally with the global_attention_mask at inference (see this example for more details). A value of 0 means a token attends locally and a value of 1 means a token attends globally.

[LongformerForMaskedLM] is trained like [RobertaForMaskedLM] and should be used as shown below.
input_ids = tokenizer.encode("This is a sentence from [MASK] training data", return_tensors="pt") mlm_labels = tokenizer.encode("This is a sentence from the training data", return_tensors="pt") loss = model(input_ids, labels=input_ids, masked_lm_labels=mlm_labels)[0]

Co-authored-by: Steven Liu <[email protected]>

HuggingFaceDocBuilderDev · 2025-04-21T17:00:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu

Thanks again! 🤗

* Update longformer.md * Update longformer.md * Update docs/source/en/model_doc/longformer.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/model_doc/longformer.md Co-authored-by: Steven Liu <[email protected]> * Update longformer.md --------- Co-authored-by: Steven Liu <[email protected]>

Update longformer.md

bef3d4f

github-actions bot marked this pull request as draft April 18, 2025 19:47

JihadHammoud02 marked this pull request as ready for review April 18, 2025 19:53

github-actions bot requested a review from stevhliu April 18, 2025 19:54

JihadHammoud02 added 2 commits April 18, 2025 21:56

Merge branch 'main' into refactor-longformer-doc

b50eec2

Update longformer.md

56c9980

stevhliu reviewed Apr 18, 2025

View reviewed changes

JihadHammoud02 and others added 3 commits April 19, 2025 10:37

Update docs/source/en/model_doc/longformer.md

d678992

Co-authored-by: Steven Liu <[email protected]>

Update docs/source/en/model_doc/longformer.md

fa1b4c7

Co-authored-by: Steven Liu <[email protected]>

Update longformer.md

d1ca40f

stevhliu approved these changes Apr 21, 2025

View reviewed changes

stevhliu merged commit b2db54f into huggingface:main Apr 21, 2025
10 checks passed

stevhliu mentioned this pull request Apr 21, 2025

[Community contributions] Model cards #36979

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update longformer.md #37622

Update longformer.md #37622

Uh oh!

JihadHammoud02 commented Apr 18, 2025

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

stevhliu left a comment

Uh oh!

Uh oh!

Uh oh!

stevhliu Apr 18, 2025

Uh oh!

stevhliu Apr 18, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 21, 2025

Uh oh!

stevhliu left a comment

Uh oh!

Uh oh!

Uh oh!


		For more information please also refer to [`~LongformerModel.forward`] method.
		Quantization reduces the memory burden of large models by representing the weights in a lower precision. Refer to the [Quantization](../quantization/overview) overview for more available quantization backends.

Update longformer.md #37622

Update longformer.md #37622

Uh oh!

Conversation

JihadHammoud02 commented Apr 18, 2025

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stevhliu Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 21, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!