-
Notifications
You must be signed in to change notification settings - Fork 42
Pull requests: evaleval/every_eval_ever
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add BenchPress score-matrix adapter (utils/benchpress)
#197
opened Jun 30, 2026 by
borgr
Collaborator
Loading…
Move validator business logic into every_eval_ever and update to de-duplication logic
#194
opened Jun 29, 2026 by
nelaturuharsha
Collaborator
Loading…
[Adapter] Add AlpacaEval 1.0 and 2.0 leaderboard adapter
#190
opened Jun 20, 2026 by
karthikchundi-commits
Contributor
Loading…
arc_agi adapter for flat storage with manifest and instance_level indexes for hf
#184
opened Jun 11, 2026 by
DeepLumiere
Loading…
Add Vectara Hallucination Leaderboard adapter
#157
opened Jun 3, 2026 by
mohammadrezakarami
Loading…
[Feature] Add text-to-image modality support
stale
#137
opened May 18, 2026 by
felifri
Loading…
5 tasks done
Fix LLM Stats evaluator provenance
schema
#136
opened May 16, 2026 by
tommasocerruti
Member
Loading…
Canonical identity and schema upgrade tooling
#116
opened Apr 24, 2026 by
yananlong
Contributor
Loading…
Draft Proposal: Agent Session Result Layer
stale
#70
opened Mar 17, 2026 by
elronbandel
Contributor
•
Draft
ProTip!
Follow long discussions with comments:>50.