shcherbak-ai/contextgem
ContextGem: Effortless LLM extraction from documents
Language:Python
Total stars: 674
Stars trend:
#python
#ai, #contractanalysis, #dataextraction, #documentintelligence, #docx, #docx2md, #docx2txt, #generativeai, #legaltech, #llm, #llmextraction, #llmframework, #llmpipeline, #llms, #nlp, #promptengineering, #textanalysis, #unstructureddata
ContextGem: Effortless LLM extraction from documents
Language:Python
Total stars: 674
Stars trend:
11 May 2025
1pm █▏ +9
2pm █▍ +11
3pm █ +8
4pm ▊ +6
5pm █ +8
6pm ▍ +3
7pm ▍ +3
8pm ▍ +3
9pm ▍ +3
10pm ▌ +4
11pm █▎ +10
12 May 2025
12am ▉ +7#python
#ai, #contractanalysis, #dataextraction, #documentintelligence, #docx, #docx2md, #docx2txt, #generativeai, #legaltech, #llm, #llmextraction, #llmframework, #llmpipeline, #llms, #nlp, #promptengineering, #textanalysis, #unstructureddata
kreuzberg-dev/kreuzberg
A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Go, and TypeScript/Node.js—or use via CLI, REST API, or MCP server.
Language:HTML
Total stars: 2721
Stars trend:
#html
#documentintelligence, #ffi, #golang, #java, #metadataextraction, #node, #pdfextraction, #pdfium, #python, #rag, #ruby, #rust, #tableextraction, #tesseract, #textextraction, #wasm
A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Go, and TypeScript/Node.js—or use via CLI, REST API, or MCP server.
Language:HTML
Total stars: 2721
Stars trend:
15 Dec 2025
8am ▏ +1
9am ▌ +4
10am ▊ +6
11am ▊ +6
12pm ▋ +5
1pm ▍ +3
2pm ▉ +7#html
#documentintelligence, #ffi, #golang, #java, #metadataextraction, #node, #pdfextraction, #pdfium, #python, #rag, #ruby, #rust, #tableextraction, #tesseract, #textextraction, #wasm
❤1