Repository brief

datalab-to/marker

Read the upstream summary on the left, browse the cached forks below it, and load each fork comparison into the right-hand panel.

Cached analysis
cached 2026-03-31T09:58:09.408Z
1mo ago

datalab-to/marker

Marker is a Python repository for converting documents, especially PDFs, into markdown, JSON, chunks, and HTML with a stated focus on speed and accuracy. It has broad document support, a large community footprint, and active recent maintenance.

GitHub
Loading tags...
Stars33,202
Forks2,298
Default branchmaster
Last pushed2026-03-10T11:23:13Z
Recommended shortcuts

Jump straight into Discofork's strongest cached fork picks, or open a compare view in one click.

Forks

Choose a fork to inspect

10 of 10 fork briefs
Selected

Prefer this fork only if Cog deployment is the main goal and you can live with an old upstream base. For active product use or ongoing development, upstream Marker is the safer default.

Prefer upstream unless you specifically need the 2024 snapshot. This fork does not add functionality, and its main tradeoff is staleness.

Prefer upstream unless you specifically need an old frozen snapshot. This fork adds no visible capabilities and is far behind current Marker, so it is a poor choice for new adoption.

Choose this fork only if its deployment packaging matches your infrastructure needs. For most adopters, upstream is the safer default because this fork is very stale and likely missing recent Marker capabilities and fixes.

Prefer upstream Marker unless you explicitly need this frozen snapshot. This fork adds no visible capabilities and is far behind upstream, so it is a poor choice for adopters who want current accuracy, bug fixes, or active maintenance.

Prefer upstream unless you specifically want a frozen, PDF-only snapshot and are prepared to own maintenance. This fork looks stale and materially behind the current Marker feature set.

Prefer upstream unless you specifically need this older snapshot; the fork adds no visible capabilities and is far behind current Marker.

Choose this fork only if the header-cleaning change is the exact behavior you need and you are prepared to own the maintenance burden. For most adopters, upstream Marker is the better default because this fork is stale and materially behind.

Choose this fork if your main need is smoother Windows installation and packaging. Stick with upstream if you want the broadest, most actively evolving base document-processing feature set and do not need the extra installer workflow.

This fork looks functionally equivalent to upstream with no visible feature additions and one upstream commit lag. Prefer upstream unless you specifically need this fork’s repository ownership or plan to maintain your own changes here.

datalab-to/marker · Discofork