By PaddlePaddle

PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Filed under Document AI & OCR. Status: Published

Stars: 79k
Forks: 10.5k
Open issues: 214
Last commit: 3d ago
Stats refreshed: Refreshed May 30, 2026

On the maker

PaddlePaddle

Pricing

Not yet curated

Field notes

No field notes yet.

Field notes for PaddleOCR will land here when sources support a confident take ; synthesized from postmortems, vendor retros, dev-team blogs, deeply-engaged GitHub issues, and our own builds.

Coverage isn’t promised on every tool ; empty sections are honest. Field notes are curated, not generated from vendor copy.

Benchmarks

Scores aren’t in yet.

We’re wiring up SWE-bench, Aider Polyglot, and a custom dev-task suite next. Methodology will be public; vendor pre-notification is 48 hours.

View benchmarks

How we make money

This directory is supported by display advertising. Advertisers do not influence editorial rankings, benchmark scoring, or which tools are featured. Tools are ordered by data.

Editorial independence policy →