Github Top Repositories
12.9K subscribers
377 photos
57 videos
9 files
1.32K links
Top GitHub repositories in one place πŸš€
Explore the best projects in programming, AI, data science, and more.
Download Telegram
πŸ”₯ Trending Repository: Dolphin

πŸ“ Description: The official repo for β€œDolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

πŸ”— Repository URL: https://github.com/bytedance/Dolphin

πŸ“– Readme: https://github.com/bytedance/Dolphin#readme

πŸ“Š Statistics:
🌟 Stars: 6.3K stars
πŸ‘€ Watchers: 53
🍴 Forks: 516 forks

πŸ’» Programming Languages: Python - Shell

🏷️ Related Topics:
#python #pdf #parser #ocr #pdf_converter #document_analysis #pdf_parser #layout_analysis #vlm_ocr


==================================
🧠 By: https://t.me/DataScienceM
πŸ”₯ Trending Repository: MinerU

πŸ“ Description: Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

πŸ”— Repository URL: https://github.com/opendatalab/MinerU

🌐 Website: https://opendatalab.github.io/MinerU/

πŸ“– Readme: https://github.com/opendatalab/MinerU#readme

πŸ“Š Statistics:
🌟 Stars: 45.7K stars
πŸ‘€ Watchers: 183
🍴 Forks: 3.8K forks

πŸ’» Programming Languages: Python - Dockerfile

🏷️ Related Topics:
#python #pdf #parser #ocr #pdf_converter #extract_data #document_analysis #pdf_parser #layout_analysis #ai4science #pdf_extractor_rag #pdf_extractor_llm #pdf_extractor_pretrain


==================================
🧠 By: https://t.me/DataScienceM
❀1
πŸ”₯ Trending Repository: opendataloader-pdf

πŸ“ Description: PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

πŸ”— Repository URL: https://github.com/opendataloader-project/opendataloader-pdf

🌐 Website: https://opendataloader.org

πŸ“– Readme: https://github.com/opendataloader-project/opendataloader-pdf#readme

πŸ“Š Statistics:
🌟 Stars: 4.7k
πŸ‘€ Watchers: 18
🍴 Forks: 355

πŸ’» Programming Languages: Java - Python - MDX - JavaScript - TypeScript - Shell

🏷️ Related Topics:
#html #markdown #pdf #json #ocr #ai #accessibility #a11y #pdf_converter #tables #ocr_recognition #pdf_parser #rag #bounding_box #eaa #pdf_extraction #tagged_pdf #document_parsing #pdf_accessibility #pdf_ua


==================================
🧠 By: https://t.me/DataScienceM
❀2