Skip to content

Pull requests: evaleval/every_eval_ever

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add BenchPress score-matrix adapter (utils/benchpress)
#197 opened Jun 30, 2026 by borgr Collaborator Loading…
[Submission] Add tau-bench leaderboard adapter
#192 opened Jun 22, 2026 by benshi34 Loading…
[Adapter] Add AlpacaEval 1.0 and 2.0 leaderboard adapter
#190 opened Jun 20, 2026 by karthikchundi-commits Contributor Loading…
Add BountyBench converter to utils
#188 opened Jun 15, 2026 by borgr Collaborator Loading…
Add LEXam public leaderboard converter
#160 opened Jun 8, 2026 by JoelNiklaus Loading…
[Feature] Add text-to-image modality support stale
#137 opened May 18, 2026 by felifri Loading…
5 tasks done
Fix LLM Stats evaluator provenance schema
#136 opened May 16, 2026 by tommasocerruti Member Loading…
[DRAFT] Transparent Compression
#133 opened May 8, 2026 by Erotemic Collaborator Loading…
Canonical identity and schema upgrade tooling
#116 opened Apr 24, 2026 by yananlong Contributor Loading…
ProTip! Follow long discussions with comments:>50.