-
Notifications
You must be signed in to change notification settings - Fork 43
Pull requests: OpenHands/benchmarks
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: auto-detect pre-built Docker images across all benchmarks
#456
opened Feb 26, 2026 by
simonrosenberg
Loading…
5 tasks done
refactor: Replace ProcessPoolExecutor with asyncio for evaluation
#446
opened Feb 25, 2026 by
simonrosenberg
•
Draft
Recycle worker processes to prevent OOM from heap fragmentation
#442
opened Feb 24, 2026 by
simonrosenberg
Loading…
3 tasks
Add ACPAgent support for SWE-bench evaluation
#440
opened Feb 23, 2026 by
simonrosenberg
•
Draft
3 tasks
fix(swtbench): align docker workspace image building with SWE-bench
#437
opened Feb 23, 2026 by
simonrosenberg
Loading…
build(deps): bump the version-all group across 1 directory with 4 updates
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#435
opened Feb 23, 2026 by
dependabot
bot
Loading…
Enable configurable context condensation in all benchmarks
#429
opened Feb 18, 2026 by
juanmichelini
Loading…
Add git reset validation script and fix missing resets
#425
opened Feb 17, 2026 by
juanmichelini
•
Draft
Fix: laminar trace timeline to account for idle wait time
#415
opened Feb 13, 2026 by
Rainhunter13
Loading…
build(deps): bump the version-all group across 1 directory with 17 updates
dependencies
Pull requests that update a dependency file
python:uv
Pull requests that update python:uv code
#404
opened Feb 9, 2026 by
dependabot
bot
Loading…
fix(swtbench): prevent build workflow from hanging indefinitely
#403
opened Feb 6, 2026 by
juanmichelini
Loading…
BREAKING: Rename --max-attempts to --n-critic-runs
#325
opened Jan 16, 2026 by
juanmichelini
•
Draft
Fix dataset loading schema validation issue in CI
#304
opened Jan 13, 2026 by
juanmichelini
Loading…
Add add_resolve_rate_to_predictions function to output_utils
#199
opened Dec 23, 2025 by
juanmichelini
•
Draft
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.