Rankings
Arena.ai leaderboards: Elo op basis van blind head-to-head stemmen. Kies een categorie.
Arena.ai leaderboards: Elo op basis van blind head-to-head stemmen. Kies een categorie.
Chat & taalmodellen (head-to-head stemmen).
28 mei 2026, 19:45
360 modellen
| # | Model | Elo |
|---|---|---|
claude-opus-4-6-thinking Anthropic 路 Proprietary 卤4 路 34.2k stemmen | 1502 | |
claude-opus-4-7-thinking Anthropic 路 Proprietary 卤5 路 20.0k stemmen | 1500 | |
claude-opus-4-6 Anthropic 路 Proprietary 卤4 路 36.5k stemmen | 1498 | |
| 4 | claude-opus-4-7 Anthropic 路 Proprietary 卤5 路 20.7k stemmen | 1494 |
| 5 | muse-spark Meta 路 Proprietary 卤6 路 12.2k stemmen | 1489 |
| 6 | gemini-3.1-pro-preview Google 路 Proprietary 卤4 路 43.7k stemmen | 1487 |
| 7 | gemini-3-pro Google 路 Proprietary 卤4 路 41.3k stemmen | 1486 |
| 8 | gpt-5.5-high OpenAI 路 Proprietary 卤6 路 16.6k stemmen | 1482 |
| 9 | gpt-5.4-high OpenAI 路 Proprietary 卤5 路 28.2k stemmen | 1480 |
| 10 | gemini-3.5-flash Google 路 Proprietary 卤7 路 9.0k stemmen | 1479 |
| 11 | gpt-5.5 OpenAI 路 Proprietary 卤6 路 16.9k stemmen | 1476 |
| 12 | gpt-5.2-chat-latest-20260210 OpenAI 路 Proprietary 卤4 路 32.3k stemmen | 1476 |
| 13 | grok-4.20-beta1 xAI 路 Proprietary 卤5 路 24.5k stemmen | 1476 |
| 14 | grok-4.20-beta-0309-reasoning xAI 路 Proprietary 卤5 路 29.1k stemmen | 1475 |
| 15 | qwen3.7-max-preview Alibaba 路 Proprietary 卤10 路 3.8k stemmen | 1475 |
Elo-score op basis van blind head-to-head stemmen. Hoger is beter. 卤 is het 95% betrouwbaarheidsinterval. Zelfde formaat als Arena.ai.
Bron op Arena.ai