Skip to content

Improve terminal-bench eval execution defaults#1031

Draft
wgqqqqq wants to merge 16 commits into
GCWing:evals-on-releasefrom
wgqqqqq:evals-on-release
Draft

Improve terminal-bench eval execution defaults#1031
wgqqqqq wants to merge 16 commits into
GCWing:evals-on-releasefrom
wgqqqqq:evals-on-release

Commits

Commits on May 29, 2026

Commits on May 30, 2026

Commits on Jun 1, 2026

Commits on Jun 2, 2026