percent · Higher = better

SWE-bench Lite

Coding-agent ability on a 300-issue subset of real GitHub issues. Measures end-to-end issue resolution rate. Expected: Q3 2026.

Leaderboard

Rank	Tool	Score	Run date
01	Aider	26.3%	Jun 1, 2026

Scores reflect the most recent run per tool. Historical runs are kept for trend tracking. Methodology is public. Corrections to hello@vybing.dev.