EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
updated a dataset about 20 hours ago
evaleval/EEE_datastore new activity 2 days ago
evaleval/EEE_datastore:[Submission] Latest LiveBench Data new activity 3 days ago
evaleval/EEE_datastore:Fix LLM Stats provenance relationshipsOrganizations
[Submission] Latest LiveBench Data
2
#138 opened 3 days ago
by
reuank
Fix LLM Stats provenance relationships
2
#137 opened 4 days ago
by
Cerru02
[ACL Shared Task] wmt25_bhojpuri_maasai: Low-resource MT evaluation (Bhojpuri & Maasai)
3
#133 opened 19 days ago
by
jboat
Shared Task - Submission
1
#136 opened 8 days ago
by
UsmanGohar
[ACL Shared Task] Add OpenAI MRCR v2 (8-needle) leaderboard results
5
#119 opened 20 days ago
by
bwingenroth
[ACL Shared Task] Add PACEBench evaluation results
4
#77 opened 27 days ago
by
mrpfisher
[ACL Shared Task] Add Chatbot Arena
16
#110 opened 21 days ago
by
muhammadravi251001
[ACL Shared Task] Add AlpacaEval
6
#129 opened 19 days ago
by
muhammadravi251001
[Submission] Journalistic-Bias Revised
1
#135 opened 16 days ago
by
WanderingIsle
Parquet for dataset viewer
#134 opened 18 days ago
by
EvalEvalBot
Generating Parquets
6
#58 opened about 1 month ago
by
EvalEvalBot
[ACL Shared Task] Add LingOly benchmark results
5
#78 opened 27 days ago
by
ambean
[ACL Shared Task] Contribute MT-Bench results
4
#124 opened 19 days ago
by
ameek
[ACL Shared Task] Contribute Humanity's Last Exam results
7
#125 opened 19 days ago
by
ameek
Add ResearchGym rg-agent GPT-5 results
4
#130 opened 19 days ago
by
anikethh
[ACL SHARED TASK] Add OUP L2-Bench
1
#131 opened 19 days ago
by
jimmyedgell
[ACL Shared Task] Contribute LiveBench Results
2
#128 opened 19 days ago
by
saki-imai
Add GSM-MC and MATH-MC Results
5
#117 opened 20 days ago
by
sanderland
Add RewardBench 2 Results
4
#118 opened 20 days ago
by
sanderland
[ACL Shared Task] Contribute MMLU Pro results
5
#126 opened 19 days ago
by
ameek