Turn any document into structured data for AI agents!
Firecrawl just released a new parse endpoint. Upload local files or non-public documents and get back clean, LLM-ready data.
The parse endpoint converts PDF, DOCX, XLSX, HTML, and other formats into Markdown, JSON, or structured output. Reading order and tables are preserved.
Upload a file via multipart/form-data. The endpoint processes it using a Rust-based engine (up to 5x faster) and returns your chosen format.
Key capabilities:
• Multiple output formats: Markdown, JSON, HTML, summaries, extracted links, or metadata
• Preserves document structure, reading order, and tables
• Extracts metadata automatically (title, description, language)
• Zero data retention option (document not logged or stored)
• Content filtering via includeTags and excludeTags
Built for AI agent pipelines that need clean document data at scale.
I've shared the link in the comments!
显示更多