Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

THE AI RACETracking the global AI race

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

← Back to Benchmarks

TAU2-bench

Conversational AI agent task completion (retail, airline, telecom)

Agents & ToolsUnit: %Max: 100Source →

Rankings (7 organizations)

65%

62%

58%

42%

5xAI

38%

6Meta AI

35%

7Cohere

32%

Other Benchmarks in Agents & Tools

WebArena Terminal-Bench

All Categories

Language & Knowledge Coding Reasoning & Math Image Generation Video Generation Multimodal Agents & Tools

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.

THE AI RACE

Leaderboard Compare Benchmarks Methodology Changelog Movers Time Machine

← Back to Benchmarks

TAU2-bench

Conversational AI agent task completion (retail, airline, telecom)

Agents & ToolsUnit: %Max: 100Source →

Rankings (7 organizations)

65%

62%

58%

42%

5xAI

38%

6Meta AI

35%

7Cohere

32%

Other Benchmarks in Agents & Tools

WebArena Terminal-Bench

All Categories

Language & Knowledge Coding Reasoning & Math Image Generation Video Generation Multimodal Agents & Tools

THE AI RACETracking the global AI race

Methodology Changelog Benchmarks About

Scores reflect editorial assessment and automated benchmark data. Not investment advice. All trademarks belong to their respective owners.