Skip to content

feat: Terminal-Bench 2.0 harness, Pi agent wrapper, and vox bench subcommand#2

Draft
Copilot wants to merge 3 commits into
mainfrom
copilot/add-terminal-bench-2-0-benchmark
Draft

feat: Terminal-Bench 2.0 harness, Pi agent wrapper, and vox bench subcommand#2
Copilot wants to merge 3 commits into
mainfrom
copilot/add-terminal-bench-2-0-benchmark

fix: address code review — move contextlib import to module level, fi…

f8e8ef6
Select commit
Loading
Failed to load commit list.

There are no checks for this commit