Skip to content

Pull requests: OpenHands/benchmarks

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add OpenAgentSafety to eval CI
#221 opened Dec 29, 2025 by simonrosenberg Loading…
Add Multi-SWE-bench image build support
#219 opened Dec 29, 2025 by simonrosenberg Loading…
Rename swebench/swtbench workflow files
#218 opened Dec 29, 2025 by simonrosenberg Loading…
Add total_duration to cost report summary
#203 opened Dec 24, 2025 by simonrosenberg Loading…
build(deps): bump the version-all group across 1 directory with 14 updates dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code
#186 opened Dec 22, 2025 by dependabot bot Loading…
Laminar evaluations
#175 opened Dec 18, 2025 by Rainhunter13 Loading…
Agentic code search
#141 opened Dec 8, 2025 by adityasoni9998 Loading…
API-based Critic implementation build-swebench-200 Build 200 SWE-Bench Verified Image based on SDK version on this PR.
#117 opened Nov 26, 2025 by xingyaoww Draft
ProTip! Add no:assignee to see everything that’s not assigned.