Kaggle Data Hub
29.3K subscribers
1.03K photos
15 videos
309 files
1.29K links
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ“Š Brazilian E-Commerce Public Dataset by Olist

πŸ“ Description:
Welcome! This is a Brazilian ecommerce public dataset of orders made at Olist Store. The dataset has information of 100k orders from 2016 to 2018 made at multiple marketplaces in Brazil. Its features allows viewing an order from multiple dimensions: from order status, price, payment and freight perf...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 9
β€’ Downloads: 201

πŸ““ Popular Notebooks:
1. /thiagopanini/e-commerce-sentiment-analysis-eda-viz-nlp
2. /andresionek/geospatial-analysis-of-brazilian-e-commerce
3. /anshumoudgil/olist-ecommerce-analytics-quasi-poisson-poly-regs

πŸ”— Powered by @datasets1
πŸ‘1
πŸ“Š PL prediction score for fantasy game *New feature

πŸ“ Description:
New feature is implemented! Calibration function is added in PredictedCompositeScore, each player calculates their own score deviation between previous week predicted score and actual result. The delta will be considered in the next gameweek prediction. ##New version is released at 8 Nov##, added mo...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 3
β€’ Downloads: 523

πŸ““ Popular Notebooks:
1. /devraai/fpl-player-analysis-and-points-prediction
2. /devraai/fpl-player-analysis-and-points-prediction
3. /devraai/fpl-player-analysis-and-points-prediction

πŸ”— Powered by @datasets1
πŸ“Š Brain Tumor MRI Dataset

πŸ“ Description:
The application of deep learning approaches in context to improve health diagnosis is providing impactful solutions. According to the World Health Organization (WHO), proper brain tumor diagnosis involves detection, brain tumor location identification, and classification of the tumor on the basis of...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 300
β€’ Downloads: 9

πŸ““ Popular Notebooks:
1. /yousefmohamed20/brain-tumor-mri-accuracy-99
2. /guslovesmath/cnn-brain-tumor-classification-99-accuracy
3. /abdallahwagih/brain-tumor-classification-pytorch

πŸ”— Powered by @datasets1
❀1
πŸ“Š Bollywood Fame Stats

πŸ“ Description:
This dataset provides information about popular Bollywood actors, including their movie appearances, total ratings, and popularity measured by Google hits. It highlights both the cinematic performance (movie count, rating scores) and the online fame (search popularity and ranking) of each actor. The...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 44

πŸ““ Popular Notebooks:
1. /eshummalik/the-spotlight-index
2. /devraai/bollywood-actor-ranking-analysis-and-prediction
3. /copperhorse/assignment

πŸ”— Powered by @datasets1
πŸ“Š Healthy Eating Dataset

πŸ“ Description:
This dataset contains 2000 synthetic meal records with detailed nutritional information, preparation details, and health indicators. It is designed for computer vision, nutrition analytics, and health-related machine learning tasks. Training CV models to recognize food images from nutritional labels...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 39

πŸ““ Popular Notebooks:
1. /jockeroika/life-style-collection-data
2. /meteyildirim2005/pandas-tutorial
3. /sumedh1507/predicting-healthy-diet

πŸ”— Powered by @datasets1
πŸ“Š Salary.Data

πŸ“ Description:
This dataset provides a comprehensive view of employee demographics, qualifications, job roles, and compensation levels. It captures crucial factors that influence salary trends, such as age, gender, education level, years of professional experience, and job title. With over 6,700 records, it enable...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 29,400

πŸ““ Popular Notebooks:
1. /ahmadrazakashif/salary-dataset
2. /chandradeepnarra3/salary-data-insights-gender-wise
3. /tahertaha/salary-analysis

πŸ”— Powered by @datasets1
❀1
πŸ“Š Premium Watch Dataset

πŸ“ Description:
This dataset contains comprehensive information about 1,670 premium watches scraped from SamiWatches, one of Turkey's leading luxury watch retailers. The dataset includes detailed specifications, pricing, and technical features of watches from 22 prestigious brands. This dataset contains comprehensi...

πŸ“₯ Download:
β€’ Size: Memory usage: 3.23 MB (optimized)
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 4
β€’ Downloads: 16

πŸ““ Popular Notebooks:
1. /devraai/premium-watch-data-analysis-and-price-modeling
2. /devraai/premium-watch-data-analysis-and-price-modeling
3. /devraai/premium-watch-data-analysis-and-price-modeling

πŸ”— Powered by @datasets1
❀4
πŸ“Š Agro Pest-12: Image Dataset for Crop Pest Detection

πŸ“ Description:
The dataset consists of high-quality pest images captured in natural agricultural environments, along with annotation files in .txt format (class x_center y_center width height). This makes it suitable for training and benchmarking deep learning models such as YOLOv5, YOLOv8, YOLOv11, Faster R-CNN, ...

πŸ“₯ Download:
β€’ Size: 563.48 MB
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 26,300
β€’ Downloads: 232

πŸ““ Popular Notebooks:
1. /rupankarmajumdar/yolo-pest-detection-classification-example
2. /miadul/crop-pest-classification-with-efficientnetb0
3. /rupankarmajumdar/pest-detection-classification-v11n

πŸ”— Powered by @datasets1
πŸ“Š Gold-Silver Price VS Geopolitical Risk (1985–2025)

πŸ“ Description:
The inspiration for compiling this dataset comes from the fact that precious metals are often considered safe-haven assets. Investors frequently turn to Gold and Silver in times of geopolitical uncertainty. By bringing these datasets together, it becomes possible to explore whether increases in glob...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 4
β€’ Downloads: 898

πŸ““ Popular Notebooks:
1. /shreyanshdangi/gold-s-grace-vs-gprd-s-gamble-risk-analysis
2. /shreyanshdangi/merging-aligning-multiple-dataset
3. /muhammedaliyilmazz/lstm-vs-arima-30-180-day-forecasts-comparison

πŸ”— Powered by @datasets1
πŸ“Š Breast Cancer Wisconsin (Diagnostic) Data Set

πŸ“ Description:
a) radius (mean of distances from center to points on the perimeter) b) texture (standard deviation of gray-scale values) c) perimeter d) area e) smoothness (local variation in radius lengths) f) compactness (perimeter^2 / area - 1.0) g) concavity (severity of concave portions of the contour) h) con...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 503

πŸ““ Popular Notebooks:
1. /kanncaa1/feature-selection-and-data-visualization
2. /buddhiniw/breast-cancer-prediction
3. /kanncaa1/statistical-learning-tutorial-for-beginners

πŸ”— Powered by @datasets1
πŸ“Š Grand_Data_Auto_Vice City

πŸ“ Description:
This dataset is about Grand City Games. It helps people to analyze the game easily. The dataset has 52099 rows and 16 columns, and there are no missing values. Grand City is a famous open-world game. With this dataset, you can explore game details, find patterns, and understand the Grand City Games ...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 60

πŸ““ Popular Notebooks:
1. /eshummalik/vicecity-analytics-journey
2. /lukhilaksh/gta-v-eda-and-prediction
3. /eshummalik/vicecity-analytics-journey

πŸ”— Powered by @datasets1
πŸ“Š NVIDIA Stock Data: Multi-Timeframe Analysis

πŸ“ Description:
This comprehensive dataset contains 26 years of NVIDIA Corporation (NVDA) stock market data spanning from 1999 to 2025, across 5 different timeframes. Perfect for technical analysis, algorithmic trading, machine learning predictions, and financial research. This comprehensive dataset contains 26 yea...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 5
β€’ Downloads: 66

πŸ““ Popular Notebooks:
1. /ibrahimqasimi/nvidia-multi-timeframe-stock-analysis-1999-2025
2. /stpeteishii/nvidia-stock-price-forecasting-with-lgbm
3. /sumedh1507/nvidia-stock-price-insights

πŸ”— Powered by @datasets1
πŸ“Š Ecommerce Dataset for Analysis

πŸ“ Description:
This dataset contains customer, product, order, and review data from an online retail platform. Each row represents an order transaction with associated customer and product details. Customer segmentation (high spenders, frequent buyers, churn risk)

πŸ“₯ Download:
β€’ Size: 1.65 MB
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 85

πŸ““ Popular Notebooks:
1. /nabihazahid/ecommerce-dataset-analysis-with-sql
2. /devraai/ecommerce-data-analysis-and-order-status-predic
3. /alexkhr/full-analysis-sentiment-with-data-errors

πŸ”— Powered by @datasets1
❀1
πŸ“Š Mall Customer Segmentation Data

πŸ“ Description:
You are owing a supermarket mall and through membership cards , you have some basic data about your customers like Customer ID, age, gender, annual income and spending score. Spending Score is something you assign to the customer based on your defined parameters like customer behavior and purchasing...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 596

πŸ““ Popular Notebooks:
1. /kushal1996/customer-segmentation-k-means-analysis
2. /vjchoudhary7/kmeans-clustering-in-customer-segmentation
3. /heeraldedhia/kmeans-clustering-for-customer-data

πŸ”— Powered by @datasets1
πŸ“Š Teo Me Why Education Platform

πŸ“ Description:
Neste dataset vocΓͺ entrarΓ‘ dados como o catΓ‘logo de todos os cursos e episΓ³dios, bem como a evoluΓ§Γ£o dos alunos. AlΓ©m disso, o aluno pode preencher suas habilidades e o nΓ­vel respectivo. VocΓͺ pode cruzar esses dados com o dataset do nosso sistema de pontos por meio da tabela usuarios_tmw.

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 9
β€’ Downloads: 14

πŸ““ Popular Notebooks:
1. /devraai/teomewhy-education-data-analysis-skill-prediction
2. /teocalvo/read-files
3. /devraai/teomewhy-education-data-analysis-skill-prediction

πŸ”— Powered by @datasets1
πŸ“Š Car_sales_info

πŸ“ Description:
This dataset contains information about car sales including details such as manufacturer, model, engine size, feul type, year of manufacture and mileage. It is useful for analyzing trends in car sales , understanding the impact of different features on pricing and building machine learning models fo...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 54

πŸ““ Popular Notebooks:
1. /minahilfatima12328/eda-car-sales-dataset
2. /zohairahm/car-data-sales
3. /osmansaid/car-sales-dataset-insights-and-visualization

πŸ”— Powered by @datasets1
❀1
πŸ“Š huggingface-deberta-v3-variants

πŸ“ Description:
You will find the standard models and also some models that I have found to be useful in different NLP competitions. Feel free to run one model first the standard and running it on a derived model to create an ensemble. This dataset contains language models released by Microsoft and fetched from the...

πŸ“₯ Download:
β€’ Size: 371.15 MB
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 97
β€’ Downloads: 8

πŸ““ Popular Notebooks:
1. /cdeotte/deberta-v3-small-starter-cv-0-820-lb-0-800
2. /thedrcat/detectai-transformers-baseline
3. /itahiro/deberta-large-2epochs-1hr

πŸ”— Powered by @datasets1
πŸ“‚ Mental Health & Social Media Balance Dataset

ℹ️ About Dataset:
This dataset captures the delicate relationship between social media habits and mental well-being. It combines variables such as screen time, stress level, sleep quality, digital detox days, and happiness index. Ideal for regression, correlation, or mental health prediction tasks. This dataset captu...

πŸ“Š Statistics:
β”œ πŸ’Ύ Size: Size not specified
β”œ πŸ—‚ Files: 1
β”” ⬇️ Downloads: 377

🧠 Top Notebooks:
πŸ”Ή mubeenshehzadi/mental-health-vs-social-media
πŸ”Ή devraai/mental-health-and-social-media-data-analysis
πŸ”Ή mahmoudredagamail/mental-health-social-media-balance-dataset

━━━━━━━━━━━━━━━━━
πŸ€– Powered by @datasets1
❀2
πŸ“‚ Post-COVID Conditions Dataset

ℹ️ About Dataset:
This dataset provides detailed estimates on Post-COVID Conditions, often referred to as "Long COVID," across the United States. The data is sourced from the Household Pulse Survey, a rapid response survey from the U.S. Census Bureau in partnership with the National Center for Health Statistics (NCHS...

πŸ“Š Statistics:
β”œ πŸ’Ύ Size: Size not specified
β”œ πŸ—‚ Files: 1
β”” ⬇️ Downloads: 11

🧠 Top Notebooks:
πŸ”Ή devraai/postcovid-conditions-analysis-and-modeling
πŸ”Ή alexkhr/post-covid-conditions-eda-statistics
πŸ”Ή devraai/postcovid-conditions-analysis-and-modeling

━━━━━━━━━━━━━━━━━
πŸ€– Powered by @datasets1
❀1
πŸ“‚ San Francisco Building Permits

ℹ️ About Dataset:
A building permit is an official approval document issued by a governmental agency that allows you or your contractor to proceed with a construction or remodeling project on one's property. For more details go to https://www.thespruce.com/what-is-a-building-permit-1398344. Each city or county has it...

πŸ“Š Statistics:
β”œ πŸ’Ύ Size: Size not specified
β”œ πŸ—‚ Files: 2
β”” ⬇️ Downloads: 48

🧠 Top Notebooks:
πŸ”Ή alexisbcook/exercise-handling-missing-values
πŸ”Ή rtatman/data-cleaning-challenge-handling-missing-values
πŸ”Ή chrisbow/cleaning-data-with-python-challenge-day-1

━━━━━━━━━━━━━━━━━
πŸ€– Powered by @datasets1
❀1