Code Stars
1.92K subscribers
9.38K photos
9.67K links
Code Stars alerts you to GitHub repos gaining stars rapidly. Stay ahead of the curve and discover trending projects before they go viral! #AI #GitHub #OpenSource #Tech #MachineLearning #Python #Programming #Java #Javascript #React #Docker #Devops
Download Telegram
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML
Total stars: 6025
Stars trend:
17 Apr 2024
5pm ▎ +2
6pm ▌ +4
7pm ▍ +3
8pm ▋ +5
9pm ▊ +6
10pm ▋ +5
11pm ▋ +5
18 Apr 2024
12am ▉ +7
1am █▏ +9
2am █▋ +13
3am █▎ +10
4am ██▏ +17

#html
#datapipelines, #deeplearning, #documentimageanalysis, #documentimageprocessing, #documentparser, #documentparsing, #docx, #donut, #informationretrieval, #langchain, #llm, #machinelearning, #ml, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdftojson, #pdftotext, #preprocessing
gotenberg/gotenberg
A developer-friendly API for converting numerous document formats into PDF files, and more!
Language:Go
Total stars: 6772
Stars trend:
18 Apr 2024
9pm ▌ +4
10pm ▏ +1
11pm ▍ +3
19 Apr 2024
12am ▏ +1
1am ▏ +1
2am +0
3am ▍ +3
4am ▍ +3
5am █▍ +11
6am ███▍ +27
7am ███▎ +26

#go
#api, #chrome, #chromium, #converter, #csv, #docker, #docx, #excel, #html, #http2, #libreoffice, #markdown, #pdf, #pdftk, #pptx, #puppeteer, #unoconv, #wkhtmltopdf, #word, #xlsx
👍1
koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
Language:JavaScript
Total stars: 16335
Stars trend:
28 May 2024
12am ▎ +2
1am █▊ +14
2am █▉ +15
3am ▉ +7
4am ▊ +6
5am █ +8
6am █▏ +9
7am ▏ +1
8am ▎ +2
9am ▋ +5
10am ▍ +3
11am ▍ +3

#javascript
#book, #cb7, #cbr, #cbt, #cbz, #comic, #docx, #ebook, #epub, #fb2, #html, #markdown, #mobi, #pdf, #reader, #rtf, #txt, #xml
👍1
brsloan/warewoolf
A minimalist novel-writing system/rich text editor designed to be usable without a mouse. For desktop and standalone word processors/digital typewriters/writerDecks.
Language:JavaScript
Total stars: 116
Stars trend:
14 Sep 2024
11pm ▎ +2
15 Sep 2024
12am █ +8
1am █▍ +11
2am █▋ +13
3am █ +8
4am ▋ +5
5am ▏ +1
6am █ +8
7am ▊ +6
8am ▍ +3
9am ▍ +3
10am █ +8

#javascript
#docx, #editor, #fiction, #markdown, #novelwriting, #quill, #richtexteditor, #texteditor, #wordprocessor, #writerdeck, #writingsoftware
QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 807
Stars trend:
3 Dec 2024
6am ▏ +1
7am +0
8am ██▋ +21
9am █▌ +12
10am █▍ +11
11am █▎ +10
12pm █▏ +9
1pm ▉ +7
2pm █▎ +10

#python
#docx, #llm, #parser, #pdf, #powerpoint
DS4SD/docling
Get your documents ready for gen AI
Language:Python
Total stars: 18111
Stars trend:
13 Jan 2025
11pm ▌ +4
14 Jan 2025
12am ▏ +1
1am ▍ +3
2am ▎ +2
3am ▋ +5
4am ▊ +6
5am ██▌ +20
6am █▋ +13
7am ▉ +7
8am █▏ +9
9am ▉ +7
10am ▊ +6

#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx
👍1
yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
29 Jan 2025
10pm █▏ +9
11pm ▌ +4
30 Jan 2025
12am █▎ +10
1am ▋ +5
2am █▏ +9
3am ▊ +6
4am ▉ +7
5am █ +8
6am ▉ +7
7am █ +8
8am ▋ +5
9am █ +8

#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata
Goldziher/kreuzberg
A text extraction library supporting PDFs, images, office documents and more
Language:Python
Total stars: 304
Stars trend:
15 Feb 2025
12am █ +8
1am ▋ +5
2am █ +8
3am ▊ +6
4am ▉ +7
5am ▉ +7
6am ▊ +6
7am ▎ +2
8am █ +8
9am █ +8
10am █▋ +13

#python
#asyncio, #docx, #ocr, #pdf, #textextraction
koodo-reader/koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
Language:JavaScript
Total stars: 21188
Stars trend:
4 Mar 2025
12am █▉ +15
1am ▌ +4
2am █▏ +9
3am █▏ +9
4am ▎ +2
5am █▏ +9
6am ▊ +6
7am ▍ +3
8am ▌ +4
9am ▋ +5
10am ▉ +7
11am ▋ +5

#javascript
#book, #cb7, #cbr, #cbt, #cbz, #comic, #docx, #ebook, #epub, #fb2, #html, #markdown, #mobi, #pdf, #reader, #rtf, #txt, #xml
docling-project/docling
Get your documents ready for gen AI
Language:Python
Total stars: 26148
Stars trend:
6 Apr 2025
11am ▏ +1
12pm +0
1pm ▍ +3
2pm ▏ +1
3pm ▌ +4
4pm ▌ +4
5pm ██▏ +17
6pm █ +8
7pm ▊ +6
8pm █▏ +9
9pm █▌ +12
10pm ██ +16

#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx
QuivrHQ/MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Language:Python
Total stars: 6094
Stars trend:
25 Apr 2025
3pm █▎ +10
4pm █▍ +11
5pm █▏ +9
6pm █▍ +11
7pm ▌ +4
8pm █ +8
9pm ▉ +7
10pm ▋ +5
11pm ▉ +7
26 Apr 2025
12am █▎ +10
1am █▎ +10

#python
#docx, #llm, #parser, #pdf, #powerpoint
👍1
shcherbak-ai/contextgem
ContextGem: Effortless LLM extraction from documents
Language:Python
Total stars: 674
Stars trend:
11 May 2025
1pm █▏ +9
2pm █▍ +11
3pm █ +8
4pm ▊ +6
5pm █ +8
6pm ▍ +3
7pm ▍ +3
8pm ▍ +3
9pm ▍ +3
10pm ▌ +4
11pm █▎ +10
12 May 2025
12am ▉ +7

#python
#ai, #contractanalysis, #dataextraction, #documentintelligence, #docx, #docx2md, #docx2txt, #generativeai, #legaltech, #llm, #llmextraction, #llmframework, #llmpipeline, #llms, #nlp, #promptengineering, #textanalysis, #unstructureddata
docling-project/docling
Get your documents ready for gen AI
Language:Python
Total stars: 33012
Stars trend:
29 Jun 2025
6am ▎ +2
7am ▉ +7
8am ▎ +2
9am ▌ +4
10am █ +8
11am █▎ +10
12pm █ +8
1pm █▋ +13
2pm █▎ +10
3pm █▊ +14
4pm █▋ +13
5pm ▌ +4

#python
#ai, #convert, #documentparser, #documentparsing, #documents, #docx, #html, #markdown, #pdf, #pdfconverter, #pdftojson, #pdftotext, #pptx, #tables, #xlsx
Doycsk/Microsoft-Word
Download Microsoft Word 2025: Advanced Document Editing, Real-Time Collaboration, AI-Powered Formatting, and Cloud Integration for Productivity
Language:
Total stars: 459
Stars trend:
15 Jul 2025
2am ███▏ +25
3am ███▌ +28
4am ████▏ +33
5am ██▊ +22
6am ███▉ +31
7am ██▌ +20


#documentediting, #docx, #microsoftword, #msword, #office365, #vbaword, #wordaddins, #wordautomation, #worddocument, #wordformatting, #wordforms, #wordmacros, #wordmailmerge, #wordplugins, #wordprocessing, #wordshortcuts, #wordstyles, #wordtemplates, #wordtips, #wordtricks
bgreenwell/doxx
Expose the contents of .docx files without leaving your terminal. Fast, safe, and smart — no Office required!
Language:Rust
Total stars: 1366
Stars trend:
21 Aug 2025
9am █ +8
10am ▉ +7
11am █ +8
12pm ▋ +5
1pm █ +8
2pm ▋ +5
3pm ▌ +4
4pm ▉ +7
5pm ▉ +7
6pm █▎ +10
7pm ▊ +6
8pm ▊ +6

#rust
#cli, #docx, #msword, #render, #rust, #terminal, #tui