Skip to content
THE AI RACE
Leaderboard
Compare
Benchmarks
Methodology
Changelog
Movers
Time Machine
⌘K
☰
THE AI RACE
Leaderboard
Compare
Benchmarks
Methodology
Changelog
Movers
Time Machine
⌘K
☰
THE AI RACE
Leaderboard
Compare
Benchmarks
Methodology
Changelog
Movers
Time Machine
⌘K
☰
← Back to Benchmarks
Terminal-Bench
AI agent CLI task completion in sandboxed Docker environments
Agents & Tools
Unit: %
Max: 100
Source →
Rankings (0 organizations)
No benchmark data available for Terminal-Bench yet.
Other Benchmarks in Agents & Tools
TAU2-bench
WebArena
All Categories
Language & Knowledge
Coding
Reasoning & Math
Image Generation
Video Generation
Multimodal
Agents & Tools