Kymata Labs/The Living IndexesBuilt by tekvisions ↗
The Document Index / PDF Extraction / #192
papercast-dev

papercast-dev/papercast

by papercast-dev · PDF Extraction · updated 1y ago

A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.

19
momentum
53
stars
3
forks
#192
rank
arxivdagdocument-parserdocument-parsinggrobidnlppdf-converterpdf-document-processorpdf-to-textpipelinepodcastpython
View on GitHub →