Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

THE AI RACETracking the global AI race

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

← Back to Benchmarks

Terminal-Bench

AI agent CLI task completion in sandboxed Docker environments

Agents & ToolsUnit: %Max: 100Source →

Rankings (6 organizations)

72%

68.5%

68%

48%

5xAI

35%

6Meta AI

32%

Other Benchmarks in Agents & Tools

TAU2-bench WebArena

All Categories

Language & Knowledge Coding Reasoning & Math Image Generation Video Generation Multimodal Agents & Tools

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

← Back to Benchmarks

Terminal-Bench

AI agent CLI task completion in sandboxed Docker environments

Agents & ToolsUnit: %Max: 100Source →

Rankings (6 organizations)

72%

68.5%

68%

48%

5xAI

35%

6Meta AI

32%

Other Benchmarks in Agents & Tools

TAU2-bench WebArena

All Categories

Language & Knowledge Coding Reasoning & Math Image Generation Video Generation Multimodal Agents & Tools

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.