By PaddlePaddle
PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Filed under Document AI & OCR. Status: Published
- Stars
- 79k
- Forks
- 10.5k
- Open issues
- 214
- Last commit
- 3d ago
- Stats refreshed
- Refreshed May 30, 2026
On the maker
Pricing
Not yet curated
Field notes
No field notes yet.
Field notes for PaddleOCR will land here when sources support a confident take ; synthesized from postmortems, vendor retros, dev-team blogs, deeply-engaged GitHub issues, and our own builds.
Coverage isn’t promised on every tool ; empty sections are honest. Field notes are curated, not generated from vendor copy.
Benchmarks
Scores aren’t in yet.
We’re wiring up SWE-bench, Aider Polyglot, and a custom dev-task suite next. Methodology will be public; vendor pre-notification is 48 hours.
View benchmarksHow we make money
This directory is supported by display advertising. Advertisers do not influence editorial rankings, benchmark scoring, or which tools are featured. Tools are ordered by data.