kreuzberg-dev/kreuzberg
A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Go, and TypeScript/Node.js—or use via CLI, REST API, or MCP server.
Language:HTML
Total stars: 2721
Stars trend:
#html
#documentintelligence, #ffi, #golang, #java, #metadataextraction, #node, #pdfextraction, #pdfium, #python, #rag, #ruby, #rust, #tableextraction, #tesseract, #textextraction, #wasm
A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Go, and TypeScript/Node.js—or use via CLI, REST API, or MCP server.
Language:HTML
Total stars: 2721
Stars trend:
15 Dec 2025
8am ▏ +1
9am ▌ +4
10am ▊ +6
11am ▊ +6
12pm ▋ +5
1pm ▍ +3
2pm ▉ +7#html
#documentintelligence, #ffi, #golang, #java, #metadataextraction, #node, #pdfextraction, #pdfium, #python, #rag, #ruby, #rust, #tableextraction, #tesseract, #textextraction, #wasm
❤1