Code Stars
1.93K subscribers
9.34K photos
9.63K links
Code Stars alerts you to GitHub repos gaining stars rapidly. Stay ahead of the curve and discover trending projects before they go viral! #AI #GitHub #OpenSource #Tech #MachineLearning #Python #Programming #Java #Javascript #React #Docker #Devops
Download Telegram
adithya-s-k/marker-api
Easily deployable ๐Ÿš€ API to convert PDF to markdown quickly with high accuracy.
Language:Python
Total stars: 157
Stars trend:
14 May 2024
10pm โ– +1
11pm +0
15 May 2024
12am โ–ˆโ–‹ +13
1am โ–ˆโ–ˆโ–ˆโ–Š +30
2am โ–ˆโ–ˆโ–ˆ +24
3am โ–ˆโ–Œ +12

#python
#api, #fastapi, #marker, #pdfconverter, #pdffiles, #pdfparser, #pdfparsing, #restapi
yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
29 Jan 2025
10pm โ–ˆโ– +9
11pm โ–Œ +4
30 Jan 2025
12am โ–ˆโ–Ž +10
1am โ–‹ +5
2am โ–ˆโ– +9
3am โ–Š +6
4am โ–‰ +7
5am โ–ˆ +8
6am โ–‰ +7
7am โ–ˆ +8
8am โ–‹ +5
9am โ–ˆ +8

#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.ไธ€็ซ™ๅผๅผ€ๆบ้ซ˜่ดจ้‡ๆ•ฐๆฎๆๅ–ๅทฅๅ…ท๏ผŒๅฐ†PDF่ฝฌๆขๆˆMarkdownๅ’ŒJSONๆ ผๅผใ€‚
Language:Python
Total stars: 27107
Stars trend:
3 Mar 2025
3am โ–ˆโ–‹ +13
4am โ–‹ +5
5am โ–‰ +7
6am โ–ˆโ– +11
7am โ–ˆโ– +9
8am โ–Š +6
9am โ–‰ +7
10am โ–ˆ +8
11am โ–Š +6
12pm โ–Š +6
1pm โ–ˆ +8
2pm โ–‰ +7

#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
โค2
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.ไธ€็ซ™ๅผๅผ€ๆบ้ซ˜่ดจ้‡ๆ•ฐๆฎๆๅ–ๅทฅๅ…ท๏ผŒๅฐ†PDF่ฝฌๆขๆˆMarkdownๅ’ŒJSONๆ ผๅผใ€‚
Language:Python
Total stars: 35977
Stars trend:
22 Jun 2025
3pm โ– +3
4pm โ–Ž +2
5pm โ–ˆโ–ˆโ– +19
6pm โ–ˆโ–ˆโ– +19
7pm โ–ˆโ– +9
8pm โ– +3
9pm โ– +3
10pm โ–ˆโ–Œ +12
11pm โ–ˆโ–ˆโ– +19
23 Jun 2025
12am โ–ˆโ–ˆโ–ˆ +24
1am โ–ˆโ–ˆโ–ˆโ–ˆโ–‹ +37
2am โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š +46

#python
#ai4science, #documentanalysis, #extractdata, #layoutanalysis, #ocr, #parser, #pdf, #pdfconverter, #pdfextractorllm, #pdfextractorpretrain, #pdfextractorrag, #pdfparser, #python
PaddlePaddle/PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 80+ languages.
Language:Python
Total stars: 54086
Stars trend:
16 Sep 2025
8am โ–Ž +2
9am โ– +1
10am โ–‹ +5
11am โ–ˆโ– +11
12pm โ–ˆโ– +9
1pm โ–ˆโ–Œ +12
2pm โ–ˆโ–ˆโ– +19
3pm โ–ˆ +8
4pm โ–ˆโ– +9
5pm โ–Œ +4
6pm โ–ˆโ–Š +14
7pm โ–Œ +4

#python
#ai4science, #chineseocr, #documentparsing, #documenttranslation, #kie, #ocr, #pdfextractorrag, #pdfparser, #pdf2markdown, #ppocr, #ppstructure, #rag