We built the fastest PDF -> markdown parser in the world 🚀⚡️
AND it’s more accurate than any other open-source, model-free parser (pymupdf4llm, opendataloader, pdf-inspector, markitdown) on 3 standardized benchmarks: olmOCR0-bench, opendataloader-bench, ParseBench
Introducing LiteParse v2.1. The v2 base version was already the fastest document->text parser on the planet, and with this new release we’ve introduced markdown.
It is fully open-source (Apache 2.0) and free, is usable from CLI/Rust/Node/Python/WASM, and is also installable as a one-click agent skill.
Check it out:
Come check out LiteParse:
LiteParse v2.1 is here, and its bringing the fastest markdown output possible.
In this release, we are fulfilling our top request: markdown output. But in the spirit of "lite"-ness, we are doing this completely LLM-free and fast.
Not only is it fast, it also beats all other model-free competitors in 3 separate benchmark datasets.
Read more about it in our release blog: