Rankings
Arena.ai leaderboards: Elo op basis van blind head-to-head stemmen. Kies een categorie.
Arena.ai leaderboards: Elo op basis van blind head-to-head stemmen. Kies een categorie.
Beeld- en multimodal understanding.
28 mei 2026, 19:45
126 modellen
| # | Model | Elo |
|---|---|---|
claude-opus-4-7-thinking Anthropic 路 Proprietary 卤9 路 6.5k stemmen | 1306 | |
claude-opus-4-7 Anthropic 路 Proprietary 卤9 路 6.8k stemmen | 1304 | |
claude-opus-4-6-thinking Anthropic 路 Proprietary 卤9 路 7.1k stemmen | 1300 | |
| 4 | muse-spark Meta 路 Proprietary 卤10 路 4.5k stemmen | 1296 |
| 5 | claude-opus-4-6 Anthropic 路 Proprietary 卤8 路 8.5k stemmen | 1293 |
| 6 | gemini-3-pro Google 路 Proprietary 卤8 路 13.2k stemmen | 1289 |
| 7 | gpt-5.5 OpenAI 路 Proprietary 卤10 路 4.7k stemmen | 1288 |
| 8 | gpt-5.2-chat-latest-20260210 OpenAI 路 Proprietary 卤8 路 12.5k stemmen | 1280 |
| 9 | gpt-5.5-high OpenAI 路 Proprietary 卤10 路 4.2k stemmen | 1278 |
| 10 | gemini-3.1-pro-preview Google 路 Proprietary 卤7 路 16.8k stemmen | 1277 |
| 11 | gpt-5.4-high OpenAI 路 Proprietary 卤9 路 5.9k stemmen | 1277 |
| 12 | claude-sonnet-4-6 Anthropic 路 Proprietary 卤8 路 8.9k stemmen | 1275 |
| 13 | gpt-5.5-instant OpenAI 路 Proprietary 卤11 路 3.6k stemmen | 1275 |
| 14 | gemini-3-flash Google 路 Proprietary 卤6 路 21.2k stemmen | 1271 |
| 15 | gpt-5.4 OpenAI 路 Proprietary 卤10 路 5.6k stemmen | 1269 |
Elo-score op basis van blind head-to-head stemmen. Hoger is beter. 卤 is het 95% betrouwbaarheidsinterval. Zelfde formaat als Arena.ai.
Bron op Arena.ai