The Document Index / PDF Extraction / #192
papercast-dev/papercast
by papercast-dev · PDF Extraction · updated 1y ago
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
19
momentum
53
stars
3
forks
#192
rank
arxivdagdocument-parserdocument-parsinggrobidnlppdf-converterpdf-document-processorpdf-to-textpipelinepodcastpython
View on GitHub →