The Document Index / OCR Engines / #119
NanoNets/docstrange
by NanoNets · OCR Engines · updated 7mo ago
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
35
momentum
1,483
stars
133
forks
#119
rank
aidocument-parserdocument-parsingimage-to-markdownllmmarkdownocrpdf-parserpdf-to-jsonpdf-to-markdownstructured-datastructured-data-capture
View on GitHub →