-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: EleutherAI/lm-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Removed repeated "Let''s think step by step." text from bbh cot prompts
#3140
opened Jul 12, 2025 by
philipdoldo
Loading…
Added mixed_precision_dtype argument to HFLM to enable autocasting
#3138
opened Jul 11, 2025 by
Avelina9X
Loading…
Fix
mmlu_continuation
subgroup names to fit Readme and other variants
#3137
opened Jul 11, 2025 by
lamalunderscore
Loading…
when using vllm with lora, it will have some mistakes, now i fix it.
#3132
opened Jul 11, 2025 by
Jacky-MYQ
Loading…
FixBug: Fix the wrong configs for gpqa_cot_n_shot
#3131
opened Jul 11, 2025 by
Summer-Summer
Loading…
Fix: extended to max_gen_toks 8192 for HRM8K math benchmarks
#3124
opened Jul 10, 2025 by
shing100
Loading…
Add support for OpenVINO text2text generation models
#3101
opened Jul 3, 2025 by
nikita-savelyevv
•
Draft
feat(api_models): add enable_thinking param in chat_template_kwargs
#3088
opened Jun 27, 2025 by
johnsonafool
Loading…
Refactor ConfigurableTask.process_results into modular helpers
#3085
opened Jun 25, 2025 by
mfisher35
Loading…
[Proposal] Change hyphens in n-shot and n-samples to underscores
#3084
opened Jun 24, 2025 by
kiersten-stokes
Loading…
Gracefully skip BigBench tasks with no data & guard final aggregation
#3066
opened Jun 17, 2025 by
NourFahmy
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.