The Document Index

The Document Index https://document.kymatalabs.com The living index of document-AI tooling — OCR, PDF extraction, document parsing, layout, tables and VLM understanding. PaddlePaddle/PaddleOCR — momentum 87https://document.kymatalabs.com/p/paddlepaddle-paddleocr/PaddlePaddle/PaddleOCRTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages. opendatalab/MinerU — momentum 86https://document.kymatalabs.com/p/opendatalab-mineru/opendatalab/MinerUTransforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows. docling-project/docling — momentum 86https://document.kymatalabs.com/p/docling-project-docling/docling-project/doclingGet your documents ready for gen AI tesseract-ocr/tesseract — momentum 85https://document.kymatalabs.com/p/tesseract-ocr-tesseract/tesseract-ocr/tesseractTesseract Open Source OCR Engine (main repository) ocrmypdf/OCRmyPDF — momentum 83https://document.kymatalabs.com/p/ocrmypdf-ocrmypdf/ocrmypdf/OCRmyPDFOCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched datalab-to/marker — momentum 82https://document.kymatalabs.com/p/datalab-to-marker/datalab-to/markerConvert PDF to markdown + JSON quickly with high accuracy opendataloader-project/opendataloader-pdf — momentum 81https://document.kymatalabs.com/p/opendataloader-project-opendataloader-pdf/opendataloader-project/opendataloader-pdfPDF Parser for AI-ready data. Automate PDF accessibility. Open-source. datalab-to/surya — momentum 80https://document.kymatalabs.com/p/datalab-to-surya/datalab-to/suryaOCR, layout analysis, reading order, table recognition in 90+ languages naptha/tesseract.js — momentum 78https://document.kymatalabs.com/p/naptha-tesseract-js/naptha/tesseract.jsPure Javascript OCR for more than 100 Languages 📖🎉🖥 Unstructured-IO/unstructured — momentum 78https://document.kymatalabs.com/p/unstructured-io-unstructured/Unstructured-IO/unstructuredConvert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning pymupdf/PyMuPDF — momentum 77https://document.kymatalabs.com/p/pymupdf-pymupdf/pymupdf/PyMuPDFPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. run-llama/liteparse — momentum 77https://document.kymatalabs.com/p/run-llama-liteparse/run-llama/liteparseA fast, helpful, and open-source document parser RapidAI/RapidOCR — momentum 75https://document.kymatalabs.com/p/rapidai-rapidocr/RapidAI/RapidOCR📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch. Zipstack/unstract — momentum 75https://document.kymatalabs.com/p/zipstack-unstract/Zipstack/unstractLLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows PaddlePaddle/PaddleX — momentum 74https://document.kymatalabs.com/p/paddlepaddle-paddlex/PaddlePaddle/PaddleXAll-in-One Development Tool based on PaddlePaddle mindee/doctr — momentum 74https://document.kymatalabs.com/p/mindee-doctr/mindee/doctrdocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning. DayBreak-u/chineseocr_lite — momentum 73https://document.kymatalabs.com/p/daybreak-u-chineseocr-lite/DayBreak-u/chineseocr_lite超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M deepdoctection/deepdoctection — momentum 71https://document.kymatalabs.com/p/deepdoctection-deepdoctection/deepdoctection/deepdoctectionA Repo For Document AI shipfastlabs/parsel — momentum 71https://document.kymatalabs.com/p/shipfastlabs-parsel/shipfastlabs/parselA fast, helpful, and open-source document parser for PHP UglyToad/PdfPig — momentum 70https://document.kymatalabs.com/p/uglytoad-pdfpig/UglyToad/PdfPigRead and extract text and other content from PDFs in C# (port of PDFBox) datalab-to/chandra — momentum 68https://document.kymatalabs.com/p/datalab-to-chandra/datalab-to/chandraOCR model that handles complex tables, forms, handwriting with full layout. Yuliang-Liu/MonkeyOCR — momentum 68https://document.kymatalabs.com/p/yuliang-liu-monkeyocr/Yuliang-Liu/MonkeyOCRA lightweight LMM-based Document Parsing Model run-llama/llama_cloud_services — momentum 68https://document.kymatalabs.com/p/run-llama-llama-cloud-services/run-llama/llama_cloud_servicesKnowledge Agents and Management in the Cloud jingsongliujing/OnnxOCR — momentum 68https://document.kymatalabs.com/p/jingsongliujing-onnxocr/jingsongliujing/OnnxOCR基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed. shcherbak-ai/contextgem — momentum 67https://document.kymatalabs.com/p/shcherbak-ai-contextgem/shcherbak-ai/contextgemContextGem: Effortless LLM extraction from documents kotaro-kinoshita/yomitoku — momentum 67https://document.kymatalabs.com/p/kotaro-kinoshita-yomitoku/kotaro-kinoshita/yomitokuYomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language. zai-org/GLM-OCR — momentum 66https://document.kymatalabs.com/p/zai-org-glm-ocr/zai-org/GLM-OCRGLM-OCR: Accurate × Fast × Comprehensive firecrawl/pdf-inspector — momentum 66https://document.kymatalabs.com/p/firecrawl-pdf-inspector/firecrawl/pdf-inspectorFast Rust library for PDF inspection, classification, and text extraction. Intelligently detects scanned vs text-based PDFs to enable smart routing decisions. unjs/unpdf — momentum 66https://document.kymatalabs.com/p/unjs-unpdf/unjs/unpdf📄 PDF extraction and rendering across all JavaScript runtimes YaoFANGUK/video-subtitle-extractor — momentum 65https://document.kymatalabs.com/p/yaofanguk-video-subtitle-extractor/YaoFANGUK/video-subtitle-extractor视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.