opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Language:Python
Total stars: 27107
Stars trend:
#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Language:Python
Total stars: 27107
Stars trend:
3 Mar 2025
3am █▋ +13
4am ▋ +5
5am ▉ +7
6am █▍ +11
7am █▏ +9
8am ▊ +6
9am ▉ +7
10am █ +8
11am ▊ +6
12pm ▊ +6
1pm █ +8
2pm ▉ +7
#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
kotaro-kinoshita/yomitoku
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Language:Python
Total stars: 697
Stars trend:
#python
#deeplearning, #layoutanalysis, #ocr, #python, #pytorch
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Language:Python
Total stars: 697
Stars trend:
20 Apr 2025
10am ▊ +6
11am █▎ +10
12pm █▍ +11
1pm █▉ +15
2pm █▊ +14
3pm ▌ +4
4pm █ +8
5pm ▌ +4
6pm +0
7pm +0
8pm ▍ +3
9pm ▌ +4
#python
#deeplearning, #layoutanalysis, #ocr, #python, #pytorch