State of the Art LLMs
March 2025 Benchmark Rankings
0
25
50
75
100
Reasoning (GPQA)
DeepSeek R1
94
OpenAI o1
92
Claude 3 Opus
88
Coding (SWE Bench)
DeepSeek R1
96
OpenAI o1
93
GPT-4.5
90