-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Issues: huggingface/open-r1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
if use lighteval ,is it necessary for one to connect to the huggingface ?
#651
opened May 21, 2025 by
SiqingHe
🚀 Introducing simpleR1: A streamlined framework for training R1-like models based on TRL grpo_trainer
#650
opened May 20, 2025 by
yflyzhang
Does the default training script for step 1 sft phase, calculate the loss for the prompt part?
#648
opened May 20, 2025 by
chunhuizhang
how can I get the prediction using the provided evaluation script?
#625
opened Apr 25, 2025 by
CurryxIaoHu
Does the Qwen-2.5-VL model in the GRPO project currently support multi-image input?
#601
opened Apr 14, 2025 by
zby1218
model.generate
produces right-padded completions, causing incompatibility with Flash Attention 2
#599
opened Apr 14, 2025 by
PolarisHsu
GRPO config for finetuning Qwen-7B-Math-Instruct on OpenR1-Math-220k
#589
opened Apr 9, 2025 by
toslali-ibm
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.