cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language:Python
Total stars: 8281
Stars trend:
#python
#activelearning, #annotation, #dataanalysis, #datacentricai, #datacleaning, #datacuration, #datalabeling, #dataprofiling, #dataquality, #datascience, #datavalidation, #dataops, #dataquality, #datasets, #labeling, #llms, #noisylabels, #outofdistributiondetection, #outlierdetection, #weaksupervision
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language:Python
Total stars: 8281
Stars trend:
8 Apr 2024
12pm ▋ +5
1pm █▉ +15
2pm █▏ +9
3pm ▊ +6
4pm █▋ +13
5pm ▌ +4
6pm ▍ +3
7pm ▊ +6
8pm ▋ +5
9pm ▊ +6
10pm ▋ +5#python
#activelearning, #annotation, #dataanalysis, #datacentricai, #datacleaning, #datacuration, #datalabeling, #dataprofiling, #dataquality, #datascience, #datavalidation, #dataops, #dataquality, #datasets, #labeling, #llms, #noisylabels, #outofdistributiondetection, #outlierdetection, #weaksupervision
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Language:Python
Total stars: 270
Stars trend:
#python
#benchmark, #datasets, #largelanguagemodels, #rag
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Language:Python
Total stars: 270
Stars trend:
28 May 2024
12am █ +8
1am ██ +16
2am █▋ +13
3am █▋ +13
4am ▉ +7
5am ▎ +2
6am ▊ +6
7am ▉ +7
8am ▉ +7#python
#benchmark, #datasets, #largelanguagemodels, #rag
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:JavaScript
Total stars: 18573
Stars trend:
#javascript
#annotation, #annotationtool, #annotations, #boundingbox, #computervision, #datalabeling, #dataset, #datasets, #deeplearning, #imageannotation, #imageclassification, #imagelabeling, #imagelabellingtool, #labelstudio, #labeling, #labelingtool, #mlops, #semanticsegmentation, #textannotation, #yolo
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:JavaScript
Total stars: 18573
Stars trend:
9 Oct 2024
7am ▍ +3
8am ▏ +1
9am ▏ +1
10am ▎ +2
11am ██▍ +19
12pm █ +8
1pm ▊ +6
2pm █▊ +14
3pm █ +8
4pm █▋ +13
5pm ▉ +7#javascript
#annotation, #annotationtool, #annotations, #boundingbox, #computervision, #datalabeling, #dataset, #datasets, #deeplearning, #imageannotation, #imageclassification, #imagelabeling, #imagelabellingtool, #labelstudio, #labeling, #labelingtool, #mlops, #semanticsegmentation, #textannotation, #yolo
❤1👍1
microsoft/torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Language:Python
Total stars: 2948
Stars trend:
#python
#computervision, #datasets, #deeplearning, #earthobservation, #geospatial, #models, #pytorch, #remotesensing, #satelliteimagery, #torchvision, #transforms
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Language:Python
Total stars: 2948
Stars trend:
9 Dec 2024
7pm ▎ +2
8pm ▍ +3
9pm ▌ +4
10pm ▌ +4
11pm █ +8
10 Dec 2024
12am ▏ +1
1am █▌ +12
2am ▊ +6
3am █▉ +15
4am ▊ +6
5am ▌ +4
6am █▍ +11#python
#computervision, #datasets, #deeplearning, #earthobservation, #geospatial, #models, #pytorch, #remotesensing, #satelliteimagery, #torchvision, #transforms
langwatch/langwatch
The ultimate LLM Ops platform - Monitoring, Analytics, Evaluations, Datasets and Prompt Optimization ✨
Language:TypeScript
Total stars: 822
Stars trend:
#typescript
#ai, #analytics, #datasets, #dspy, #evaluation, #gpt, #llm, #llmops, #lowcode, #observability, #openai, #promptengineering
The ultimate LLM Ops platform - Monitoring, Analytics, Evaluations, Datasets and Prompt Optimization ✨
Language:TypeScript
Total stars: 822
Stars trend:
16 Jan 2025
4pm ▊ +6
5pm █▍ +11
6pm █▌ +12
7pm █▎ +10
8pm ▋ +5
9pm ▉ +7
10pm █ +8
11pm ▎ +2
17 Jan 2025
12am ▌ +4
1am ▊ +6
2am ▋ +5#typescript
#ai, #analytics, #datasets, #dspy, #evaluation, #gpt, #llm, #llmops, #lowcode, #observability, #openai, #promptengineering
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:JavaScript
Total stars: 23667
Stars trend:
#javascript
#annotation, #annotationtool, #annotations, #boundingbox, #computervision, #datalabeling, #dataset, #datasets, #deeplearning, #imageannotation, #imageclassification, #imagelabeling, #imagelabellingtool, #labelstudio, #labeling, #labelingtool, #mlops, #semanticsegmentation, #textannotation, #yolo
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:JavaScript
Total stars: 23667
Stars trend:
24 Jul 2025
1pm █ +8
2pm ▉ +7
3pm ▉ +7
4pm ▍ +3
5pm ▉ +7
6pm ▍ +3
7pm ▋ +5
8pm █▏ +9
9pm ▋ +5
10pm ▋ +5
11pm █ +8
25 Jul 2025
12am █▏ +9#javascript
#annotation, #annotationtool, #annotations, #boundingbox, #computervision, #datalabeling, #dataset, #datasets, #deeplearning, #imageannotation, #imageclassification, #imagelabeling, #imagelabellingtool, #labelstudio, #labeling, #labelingtool, #mlops, #semanticsegmentation, #textannotation, #yolo
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
Language:
Total stars: 65037
Stars trend:
#aaronswartz, #awesomepublicdatasets, #datasets, #opendata
A topic-centric list of HQ open datasets.
Language:
Total stars: 65037
Stars trend:
30 Aug 2025
5am ▌ +4
6am ▊ +6
7am ▏ +1
8am ▍ +3
9am ▌ +4
10am ▎ +2
11am █▎ +10
12pm █▋ +13
1pm █▎ +10
2pm █▏ +9
3pm █▎ +10
4pm ██ +16#aaronswartz, #awesomepublicdatasets, #datasets, #opendata
Olcmyk/HuChenFeng
收集户晨风的所有内容
Language:
Total stars: 143
Stars trend:
#archiving, #contentanalysis, #datasets, #internetculture, #livestream, #socialmedia, #speechtotext, #textanalysis, #transcript
收集户晨风的所有内容
Language:
Total stars: 143
Stars trend:
18 Oct 2025
2am █▍ +11
3am ▉ +7
4am ▌ +4
5am █▊ +14
6am ▎ +2#archiving, #contentanalysis, #datasets, #internetculture, #livestream, #socialmedia, #speechtotext, #textanalysis, #transcript
Arize-ai/phoenix
AI Observability & Evaluation
Language:Jupyter Notebook
Total stars: 7431
Stars trend:
#jupyternotebook
#agents, #aimonitoring, #aiobservability, #aiengineering, #anthropic, #datasets, #evals, #langchain, #llamaindex, #llmeval, #llmevaluation, #llmops, #llms, #openai, #promptengineering, #smolagents
AI Observability & Evaluation
Language:Jupyter Notebook
Total stars: 7431
Stars trend:
24 Oct 2025
9pm ▎ +2
10pm ██ +16
11pm █ +8
25 Oct 2025
12am ███▍ +27
1am █▍ +11#jupyternotebook
#agents, #aimonitoring, #aiobservability, #aiengineering, #anthropic, #datasets, #evals, #langchain, #llamaindex, #llmeval, #llmevaluation, #llmops, #llms, #openai, #promptengineering, #smolagents
bytewax/awesome-public-real-time-datasets
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
Language:
Total stars: 1480
Stars trend:
#awesomelist, #data, #datascience, #datavisualization, #datasets, #realtime, #streaming
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
Language:
Total stars: 1480
Stars trend:
13 Nov 2025
5pm ▏ +1
6pm +0
7pm ▏ +1
8pm +0
9pm +0
10pm ▏ +1
11pm ▏ +1
14 Nov 2025
12am ▎ +2
1am ▎ +2
2am █▏ +9
3am █▎ +10
4am ▍ +3#awesomelist, #data, #datascience, #datavisualization, #datasets, #realtime, #streaming