The Document Index / VLM & Understanding / #36
kreuzberg-dev/html-to-markdown
by kreuzberg-dev · VLM & Understanding · updated today
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.
64
momentum
766
stars
58
forks
#36
rank
hocrhtmlhtml-convertermarkdownmarkdown-converterragtext-extractiontext-processing
View on GitHub →