elo · Higher = better
Chatbot Arena Elo
Human-preference Elo rating from LMSys Chatbot Arena. Measures conversational quality via pairwise battle votes; the only Phase-1 benchmark with a human-preference signal rather than a capability metric.
Leaderboard
| Rank | Tool | Score | Run date |
|---|---|---|---|
| 01 | Anthropic API | 1499 | Jun 1, 2026 |
| 02 | OpenAI API | 1472 | Jun 1, 2026 |
| 03 | Mistral API | 1430 | Jun 1, 2026 |
Scores reflect the most recent run per tool. Historical runs are kept for trend tracking. Methodology is public. Corrections to hello@vybing.dev.