Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML
Total stars: 6025
Stars trend:
#html
#datapipelines, #deeplearning, #documentimageanalysis, #documentimageprocessing, #documentparser, #documentparsing, #docx, #donut, #informationretrieval, #langchain, #llm, #machinelearning, #ml, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdftojson, #pdftotext, #preprocessing
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML
Total stars: 6025
Stars trend:
17 Apr 2024
5pm ▎ +2
6pm ▌ +4
7pm ▍ +3
8pm ▋ +5
9pm ▊ +6
10pm ▋ +5
11pm ▋ +5
18 Apr 2024
12am ▉ +7
1am █▏ +9
2am █▋ +13
3am █▎ +10
4am ██▏ +17
#html
#datapipelines, #deeplearning, #documentimageanalysis, #documentimageprocessing, #documentparser, #documentparsing, #docx, #donut, #informationretrieval, #langchain, #llm, #machinelearning, #ml, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdftojson, #pdftotext, #preprocessing
mindspore-lab/mindnlp
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
Language:Python
Total stars: 409
Stars trend:
#python
#deeplearning, #largelanguagemodels, #llm, #mindspore, #naturallanguageprocessing, #nlp, #nlplibrary, #python
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
Language:Python
Total stars: 409
Stars trend:
10 May 2024
1am █ +8
2am █████▋ +45
3am ████████████████▉ +135
#python
#deeplearning, #largelanguagemodels, #llm, #mindspore, #naturallanguageprocessing, #nlp, #nlplibrary, #python
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Language:
Total stars: 26245
Stars trend:
#applieddatascience, #appliedmachinelearning, #computervision, #datadiscovery, #dataengineering, #dataquality, #datascience, #deeplearning, #machinelearning, #naturallanguageprocessing, #production, #recsys, #reinforcementlearning, #search
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Language:
Total stars: 26245
Stars trend:
18 Jun 2024
6pm ▍ +3
7pm █▌ +12
8pm █▍ +11
9pm █▎ +10
10pm ▌ +4
11pm █▏ +9
19 Jun 2024
12am ▋ +5
1am ▊ +6
2am ▋ +5
3am ▊ +6
4am █ +8
#applieddatascience, #appliedmachinelearning, #computervision, #datadiscovery, #dataengineering, #dataquality, #datascience, #deeplearning, #machinelearning, #naturallanguageprocessing, #production, #recsys, #reinforcementlearning, #search
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Language:Jupyter Notebook
Total stars: 36368
Stars trend:
#jupyternotebook
#dataengineering, #dataquality, #datascience, #deeplearning, #distributedml, #distributedtraining, #llms, #machinelearning, #mlops, #naturallanguageprocessing, #python, #pytorch, #ray
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Language:Jupyter Notebook
Total stars: 36368
Stars trend:
19 Jun 2024
1am ▌ +4
2am ▍ +3
3am ▎ +2
4am ▉ +7
5am ▉ +7
6am ▉ +7
7am █▍ +11
8am █▎ +10
9am ▋ +5
10am ▋ +5
11am █▎ +10
12pm ▊ +6
#jupyternotebook
#dataengineering, #dataquality, #datascience, #deeplearning, #distributedml, #distributedtraining, #llms, #machinelearning, #mlops, #naturallanguageprocessing, #python, #pytorch, #ray
argilla-io/argilla
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
Language:Python
Total stars: 3309
Stars trend:
#python
#activelearning, #ai, #annotationtool, #developertools, #gpt4, #humanintheloop, #langchain, #llm, #machinelearning, #mlops, #naturallanguageprocessing, #nlp, #rlhf, #textannotation, #textlabeling, #weaksupervision, #weaklysupervisedlearning
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
Language:Python
Total stars: 3309
Stars trend:
19 Jun 2024
11am ▉ +7
12pm █▎ +10
1pm █▌ +12
2pm █ +8
3pm █ +8
4pm █▏ +9
5pm ▌ +4
6pm ▍ +3
7pm █▏ +9
8pm █▍ +11
#python
#activelearning, #ai, #annotationtool, #developertools, #gpt4, #humanintheloop, #langchain, #llm, #machinelearning, #mlops, #naturallanguageprocessing, #nlp, #rlhf, #textannotation, #textlabeling, #weaksupervision, #weaklysupervisedlearning
languagetool-org/languagetool
Style and Grammar Checker for 25+ Languages
Language:Java
Total stars: 12152
Stars trend:
#java
#grammar, #naturallanguage, #naturallanguageprocessing, #proofreading, #spellcheck, #stylechecker
Style and Grammar Checker for 25+ Languages
Language:Java
Total stars: 12152
Stars trend:
18 Sep 2024
5am ▏ +1
6am ██▉ +23
7am ███ +24
8am ███▍ +27
9am ██▋ +21
#java
#grammar, #naturallanguage, #naturallanguageprocessing, #proofreading, #spellcheck, #stylechecker
kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML
Total stars: 12193
Stars trend:
#html
#artificialintelligencealgorithms, #artificialneuralnetworks, #bayesianstatistics, #computervision, #deeplearning, #deepneuralnetworks, #deepreinforcementlearning, #explainableai, #geometricdeeplearning, #graphneuralnetworks, #machinelearning, #medicalimaging, #naturallanguageprocessing, #optimization, #patternrecognition, #probabilisticgraphicalmodels, #probability, #reinforcementlearning, #speechrecognition, #visualrecognition
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Language:HTML
Total stars: 12193
Stars trend:
8 Oct 2024
3pm +1
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
9 Oct 2024
12am +0
1am +0
2am ██████████████████▊ +162
#html
#artificialintelligencealgorithms, #artificialneuralnetworks, #bayesianstatistics, #computervision, #deeplearning, #deepneuralnetworks, #deepreinforcementlearning, #explainableai, #geometricdeeplearning, #graphneuralnetworks, #machinelearning, #medicalimaging, #naturallanguageprocessing, #optimization, #patternrecognition, #probabilisticgraphicalmodels, #probability, #reinforcementlearning, #speechrecognition, #visualrecognition
❤1
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python
Total stars: 15699
Stars trend:
#python
#chinese, #flashattention, #largelanguagemodels, #llm, #naturallanguageprocessing, #pretrainedmodels
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python
Total stars: 15699
Stars trend:
29 Jan 2025
8am ▉ +7
9am ▍ +3
10am ▊ +6
11am ▊ +6
12pm ▏ +1
1pm ▍ +3
2pm █▎ +10
3pm █ +8
4pm █▎ +10
5pm █▌ +12
6pm ▊ +6
7pm ▌ +4
#python
#chinese, #flashattention, #largelanguagemodels, #llm, #naturallanguageprocessing, #pretrainedmodels
yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language:Rust
Total stars: 777
Stars trend:
29 Jan 2025
10pm █▏ +9
11pm ▌ +4
30 Jan 2025
12am █▎ +10
1am ▋ +5
2am █▏ +9
3am ▊ +6
4am ▉ +7
5am █ +8
6am ▉ +7
7am █ +8
8am ▋ +5
9am █ +8
#rust
#datapipelines, #docx, #etl, #etlpipelines, #extraction, #llm, #machinelearning, #naturallanguageprocessing, #nlp, #ocr, #pdf, #pdfparser, #rag, #rust, #tika, #unstructured, #unstructureddata
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Language:Python
Total stars: 6696
Stars trend:
#python
#agent, #aisocieties, #artificialintelligence, #communicativeai, #cooperativeai, #deeplearning, #largelanguagemodels, #multiagentsystems, #naturallanguageprocessing
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Language:Python
Total stars: 6696
Stars trend:
6 Mar 2025
4pm ▏ +1
5pm +0
6pm ▏ +1
7pm ▏ +1
8pm ▏ +1
9pm +0
10pm +0
11pm +0
7 Mar 2025
12am ▎ +2
1am ██▎ +18
2am ███████▌ +60
3am ██████▉ +55
#python
#agent, #aisocieties, #artificialintelligence, #communicativeai, #cooperativeai, #deeplearning, #largelanguagemodels, #multiagentsystems, #naturallanguageprocessing
girafe-ai/ml-course
Open Machine Learning course
Language:Jupyter Notebook
Total stars: 2606
Stars trend:
#jupyternotebook
#computervision, #course, #deeplearning, #machinelearning, #materials, #naturallanguageprocessing, #python, #pytorch, #reinforcementlearning, #seminars
Open Machine Learning course
Language:Jupyter Notebook
Total stars: 2606
Stars trend:
9 Apr 2025
11am ▌ +4
12pm ▉ +7
1pm █▏ +9
2pm █ +8
3pm ▊ +6
4pm ▏ +1
5pm ▎ +2
6pm ▌ +4
7pm █▎ +10
8pm ▉ +7
9pm █▏ +9
10pm █▏ +9
#jupyternotebook
#computervision, #course, #deeplearning, #machinelearning, #materials, #naturallanguageprocessing, #python, #pytorch, #reinforcementlearning, #seminars
bee-san/Ciphey
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Language:Python
Total stars: 19141
Stars trend:
#python
#artificialintelligence, #cipher, #cpp, #cryptography, #ctf, #ctftools, #cyberchefmagic, #decryption, #deepneuralnetwork, #encodings, #encryptions, #hacking, #hacktoberfest, #hashes, #naturallanguageprocessing, #pentesting, #python
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Language:Python
Total stars: 19141
Stars trend:
7 May 2025
2pm █▋ +13
3pm ▊ +6
4pm ▋ +5
5pm ▌ +4
6pm ▌ +4
7pm +0
8pm ▏ +1
9pm ▌ +4
10pm ▎ +2
11pm ▉ +7
8 May 2025
12am █▉ +15
1am ██▌ +20
#python
#artificialintelligence, #cipher, #cpp, #cryptography, #ctf, #ctftools, #cyberchefmagic, #decryption, #deepneuralnetwork, #encodings, #encryptions, #hacking, #hacktoberfest, #hashes, #naturallanguageprocessing, #pentesting, #python
arc53/DocsGPT
DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.
Language:TypeScript
Total stars: 16162
Stars trend:
#typescript
#ai, #chatgpt, #docsgpt, #hacktoberfest, #informationretrieval, #languagemodel, #llm, #machinelearning, #naturallanguageprocessing, #python, #pytorch, #rag, #react, #semanticsearch, #transformers, #webapp
DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.
Language:TypeScript
Total stars: 16162
Stars trend:
16 Jul 2025
5pm █▏ +9
6pm ▊ +6
7pm ▋ +5
8pm ▌ +4
9pm ▍ +3
10pm ▍ +3
11pm █▏ +9
17 Jul 2025
12am ▊ +6
1am ██ +16
2am ██▏ +17
3am █▊ +14
4am ▊ +6
#typescript
#ai, #chatgpt, #docsgpt, #hacktoberfest, #informationretrieval, #languagemodel, #llm, #machinelearning, #naturallanguageprocessing, #python, #pytorch, #rag, #react, #semanticsearch, #transformers, #webapp
srbhr/Resume-Matcher
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
Language:TypeScript
Total stars: 9283
Stars trend:
#typescript
#applicanttrackingsystem, #ats, #hacktoberfest, #machinelearning, #naturallanguageprocessing, #nextjs, #python, #resume, #resumebuilder, #resumeparser, #textsimilarity, #typescript, #vectorsearch, #wordembeddings
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
Language:TypeScript
Total stars: 9283
Stars trend:
19 Jul 2025
7am ▌ +4
8am ▏ +1
9am ▍ +3
10am ▍ +3
11am █▏ +9
12pm ▋ +5
1pm █▍ +11
2pm ▎ +2
3pm █▎ +10
4pm █▌ +12
5pm █▎ +10
6pm █▍ +11
#typescript
#applicanttrackingsystem, #ats, #hacktoberfest, #machinelearning, #naturallanguageprocessing, #nextjs, #python, #resume, #resumebuilder, #resumeparser, #textsimilarity, #typescript, #vectorsearch, #wordembeddings