Kymata Labs/The Living IndexesBuilt by tekvisions ↗
The Document Index / PDF Extraction / #53
ispras

ispras/dedoc

by ispras · PDF Extraction · updated 1mo ago

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

57
momentum
712
stars
57
forks
#53
rank
docdocument-analysisdocument-content-extractiondocumentsdocxdocx-parserexcelhtmlhtml-parserlogical-structure-extractionocrodt
View on GitHub →