๐ฅ Trending Repository: PaddleOCR
๐ Description: Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
๐ Repository URL: https://github.com/PaddlePaddle/PaddleOCR
๐ Website: https://www.paddleocr.ai
๐ Readme: https://github.com/PaddlePaddle/PaddleOCR#readme
๐ Statistics:
๐ Stars: 53.9K stars
๐ Watchers: 470
๐ด Forks: 8.6K forks
๐ป Programming Languages: Python - C++ - Shell - Java - CMake - Cuda
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
๐ Repository URL: https://github.com/PaddlePaddle/PaddleOCR
๐ Website: https://www.paddleocr.ai
๐ Readme: https://github.com/PaddlePaddle/PaddleOCR#readme
๐ Statistics:
๐ Stars: 53.9K stars
๐ Watchers: 470
๐ด Forks: 8.6K forks
๐ป Programming Languages: Python - C++ - Shell - Java - CMake - Cuda
๐ท๏ธ Related Topics:
#ocr #db #kie #crnn #document_translation #ocrlite #chineseocr #pp_ocr #document_parsing #pp_structure #pdf2markdown #chatocr
==================================
๐ง By: https://t.me/DataScienceM
๐ฅ Trending Repository: Dolphin
๐ Description: The official repo for โDolphin: Document Image Parsing via Heterogeneous Anchor Promptingโ, ACL, 2025.
๐ Repository URL: https://github.com/bytedance/Dolphin
๐ Readme: https://github.com/bytedance/Dolphin#readme
๐ Statistics:
๐ Stars: 6.3K stars
๐ Watchers: 53
๐ด Forks: 516 forks
๐ป Programming Languages: Python - Shell
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: The official repo for โDolphin: Document Image Parsing via Heterogeneous Anchor Promptingโ, ACL, 2025.
๐ Repository URL: https://github.com/bytedance/Dolphin
๐ Readme: https://github.com/bytedance/Dolphin#readme
๐ Statistics:
๐ Stars: 6.3K stars
๐ Watchers: 53
๐ด Forks: 516 forks
๐ป Programming Languages: Python - Shell
๐ท๏ธ Related Topics:
#python #pdf #parser #ocr #pdf_converter #document_analysis #pdf_parser #layout_analysis #vlm_ocr
==================================
๐ง By: https://t.me/DataScienceM
๐ฅ Trending Repository: PDFMathTranslate
๐ Description: PDF scientific paper translation with preserved formats - ๅบไบ AI ๅฎๆดไฟ็ๆ็็ PDF ๆๆกฃๅ จๆๅ่ฏญ็ฟป่ฏ๏ผๆฏๆ Google/DeepL/Ollama/OpenAI ็ญๆๅก๏ผๆไพ CLI/GUI/MCP/Docker/Zotero
๐ Repository URL: https://github.com/Byaidu/PDFMathTranslate
๐ Website: https://pdf2zh.com
๐ Readme: https://github.com/Byaidu/PDFMathTranslate#readme
๐ Statistics:
๐ Stars: 28.2K stars
๐ Watchers: 104
๐ด Forks: 2.5K forks
๐ป Programming Languages: Python
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: PDF scientific paper translation with preserved formats - ๅบไบ AI ๅฎๆดไฟ็ๆ็็ PDF ๆๆกฃๅ จๆๅ่ฏญ็ฟป่ฏ๏ผๆฏๆ Google/DeepL/Ollama/OpenAI ็ญๆๅก๏ผๆไพ CLI/GUI/MCP/Docker/Zotero
๐ Repository URL: https://github.com/Byaidu/PDFMathTranslate
๐ Website: https://pdf2zh.com
๐ Readme: https://github.com/Byaidu/PDFMathTranslate#readme
๐ Statistics:
๐ Stars: 28.2K stars
๐ Watchers: 104
๐ด Forks: 2.5K forks
๐ป Programming Languages: Python
๐ท๏ธ Related Topics:
#python #pdf #latex #translation #math #mcp #japanese #english #openai #translate #document #chinese #edit #modify #russian #korean #zotero #obsidian #pdf2zh
==================================
๐ง By: https://t.me/DataScienceM
๐ฅ Trending Repository: MinerU
๐ Description: Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
๐ Repository URL: https://github.com/opendatalab/MinerU
๐ Website: https://opendatalab.github.io/MinerU/
๐ Readme: https://github.com/opendatalab/MinerU#readme
๐ Statistics:
๐ Stars: 45.7K stars
๐ Watchers: 183
๐ด Forks: 3.8K forks
๐ป Programming Languages: Python - Dockerfile
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
๐ Repository URL: https://github.com/opendatalab/MinerU
๐ Website: https://opendatalab.github.io/MinerU/
๐ Readme: https://github.com/opendatalab/MinerU#readme
๐ Statistics:
๐ Stars: 45.7K stars
๐ Watchers: 183
๐ด Forks: 3.8K forks
๐ป Programming Languages: Python - Dockerfile
๐ท๏ธ Related Topics:
#python #pdf #parser #ocr #pdf_converter #extract_data #document_analysis #pdf_parser #layout_analysis #ai4science #pdf_extractor_rag #pdf_extractor_llm #pdf_extractor_pretrain
==================================
๐ง By: https://t.me/DataScienceM
โค1
๐ฅ Trending Repository: ragflow
๐ Description: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
๐ Repository URL: https://github.com/infiniflow/ragflow
๐ Website: https://ragflow.io
๐ Readme: https://github.com/infiniflow/ragflow#readme
๐ Statistics:
๐ Stars: 69.3K stars
๐ Watchers: 310
๐ด Forks: 7.5K forks
๐ป Programming Languages: Python - TypeScript - Less - Shell - HTML - CSS
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
๐ Repository URL: https://github.com/infiniflow/ragflow
๐ Website: https://ragflow.io
๐ Readme: https://github.com/infiniflow/ragflow#readme
๐ Statistics:
๐ Stars: 69.3K stars
๐ Watchers: 310
๐ด Forks: 7.5K forks
๐ป Programming Languages: Python - TypeScript - Less - Shell - HTML - CSS
๐ท๏ธ Related Topics:
#agent #ai #deep_learning #mcp #multi_agent #openai #document_parser #ai_search #rag #document_understanding #llm #agentic #retrieval_augmented_generation #ollama #deepseek #graphrag #agentic_workflow #agentic_ai #deepseek_r1 #deep_research
==================================
๐ง By: https://t.me/DataScienceM
โค1
๐ฅ Trending Repository: ConvertX
๐ Description: ๐พ Self-hosted online file converter. Supports 1000+ formats โ๏ธ
๐ Repository URL: https://github.com/C4illin/ConvertX
๐ Readme: https://github.com/C4illin/ConvertX#readme
๐ Statistics:
๐ Stars: 10.4K stars
๐ Watchers: 24
๐ด Forks: 533 forks
๐ป Programming Languages: TypeScript - JavaScript - Dockerfile - CSS
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: ๐พ Self-hosted online file converter. Supports 1000+ formats โ๏ธ
๐ Repository URL: https://github.com/C4illin/ConvertX
๐ Readme: https://github.com/C4illin/ConvertX#readme
๐ Statistics:
๐ Stars: 10.4K stars
๐ Watchers: 24
๐ด Forks: 533 forks
๐ป Programming Languages: TypeScript - JavaScript - Dockerfile - CSS
๐ท๏ธ Related Topics:
#converter #typescript #document_conversion #convert #conversion #pdf_converter #self_hosted #file_converter #file_conversion #hacktoberfest #bun #tailwindcss #elysia
==================================
๐ง By: https://t.me/DataScienceM