Skip to content

Pull requests: open-compass/opencompass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update matbench testing
#2116 opened May 23, 2025 by smgjch Loading…
6 tasks done
More stable MBPP evaluation
#2111 opened May 21, 2025 by f14-bertolotti Loading…
[RULER] Extend 256k and 512k data generators
#2109 opened May 21, 2025 by changlan Loading…
6 tasks done
SRbench
#2105 opened May 20, 2025 by soki123 Loading…
update earth silver benchmark
#2104 opened May 18, 2025 by Zhouzone Loading…
4 of 6 tasks
healthbench
#2099 opened May 15, 2025 by bio-mlhui Loading…
1 task
[Dataset] Add R-Bench (ICML 2025)
#2091 opened May 11, 2025 by uyzhang Loading…
3 of 6 tasks
[Update] Enhancements and Fixes in NeedlebenchV2
#2090 opened May 9, 2025 by Mor-Li Loading…
BaseInferencer batch_size and max_seq_len cast to int
#2074 opened May 5, 2025 by f14-bertolotti Loading…
6 tasks
PromptCBLUE:Life Science dataset
#2073 opened May 4, 2025 by tchenglv520 Loading…
6 tasks done
Phybench
#2069 opened Apr 30, 2025 by epsilondylan Loading…
2 tasks
fix llm judge evaluator import and docs
#2057 opened Apr 27, 2025 by smgjch Loading…
[Dataset]Add GAIA Datasets
#2051 opened Apr 26, 2025 by domonic18 Loading…
1 of 6 tasks
[Feature] Support AntFinix LLM
#2043 opened Apr 24, 2025 by xsq2060 Loading…
replace the model name for new version of bailing
#2034 opened Apr 23, 2025 by cuauty Loading…
6 tasks done
[Dataset] Add SeedBench Dataset
#2020 opened Apr 14, 2025 by ChenZiHong-Gavin Loading…
5 tasks done
[Update] Code related benchmarks update
#2005 opened Apr 6, 2025 by Zhudongsheng75 Loading…
[Fix] Fix default torch dtype loading
#1969 opened Mar 25, 2025 by liushz Loading…
6 tasks
[Model] Add new model: Ola
#1912 opened Mar 4, 2025 by bobo0810 Loading…
4 of 6 tasks
add additional evaluation configs
#1882 opened Feb 19, 2025 by leao1995 Loading…
6 tasks done
Add Benchmax part 1
#1863 opened Feb 11, 2025 by hanxuhu Loading…
Add HuProverbRea (02031239)
#1853 opened Feb 3, 2025 by little-bird-vodka Loading…
6 tasks
ProTip! Follow long discussions with comments:>50.