Skip to content
THE AI RACE
Leaderboard
Compare
Benchmarks
Methodology
Changelog
Movers
Time Machine
⌘K
☰
THE AI RACE
Leaderboard
Compare
Benchmarks
Methodology
Changelog
Movers
Time Machine
⌘K
☰
THE AI RACE
Leaderboard
Compare
Benchmarks
Methodology
Changelog
Movers
Time Machine
⌘K
☰
← Back to Benchmarks
SWE-bench Verified
Real-world software engineering — resolving GitHub issues
Coding
Unit: %
Max: 100
Source →
Rankings (10 organizations)
1
Google DeepMind
77.4%
2
Anthropic
76.8%
3
OpenAI
74.4%
4
Zhipu AI
72.8%
5
DeepSeek
70%
6
xAI
48%
7
Meta AI
40.6%
8
Mistral
40%
9
Cohere
32%
10
Alibaba Qwen
9%
Other Benchmarks in Coding
HumanEval+
LiveCodeBench
Aider Polyglot
BigCodeBench
All Categories
Language & Knowledge
Coding
Reasoning & Math
Image Generation
Video Generation
Multimodal
Agents & Tools