The Document Index / Collections / #156
GiftMungmeeprued/document-parsers-list
by GiftMungmeeprued · Collections · updated 11mo ago
A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equations, handwriting, two-column layouts, and multi-column layouts.
25
momentum
179
stars
2
forks
#156
rank
data-pipelinedocument-image-processingdocument-parserdocument-parsinglangchainocrpdfpdf-to-textpreprocessing
View on GitHub →