Repository brief

PaddlePaddle/PaddleOCR

Read the upstream summary on the left, browse the cached forks below it, and load each fork comparison into the right-hand panel.

Cached analysis
cached 2026-03-30T11:51:16.063Z

PaddlePaddle/PaddleOCR

PaddleOCR is a large, active Python OCR and document-parsing repository from PaddlePaddle. It targets turning PDFs or images into structured data, supports 100+ languages, and ships as an Apache-2.0 project with a CLI package (`paddleocr`) plus documentation and deployment assets.

GitHub
Stars73,404
Forks10,052
Default branchmain
Last pushed2026-03-30T05:36:55Z
Best maintainedNone
Closest to upstreamtimminator/PaddleOCR-Standalone
Most feature-richMicro-sheep/PaddleOCR
Most opinionatedfuture1314/PaddleOCR
Forks

Choose a fork to inspect

10 cached fork briefs
PaddlePaddle/PaddleOCR · Discofork