percent · Higher = better
SWE-bench Lite
Coding-agent ability on a 300-issue subset of real GitHub issues. Measures end-to-end issue resolution rate. Expected: Q3 2026.
Leaderboard
| Rank | Tool | Score | Run date |
|---|---|---|---|
| 01 | Aider | 26.3% | Jun 1, 2026 |
Scores reflect the most recent run per tool. Historical runs are kept for trend tracking. Methodology is public. Corrections to hello@vybing.dev.