Kaggle Data Hub
29.2K subscribers
996 photos
15 videos
309 files
1.26K links
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ“Š AI Hallucination Cases Data 2025

πŸ“ Description:
Damien Charlotin’s AI Hallucination Cases database β€œtracks legal decisions in cases where generative AI produced hallucinated content – typically fake citations, but also other types of arguments.” Charlotin, who teaches a course called β€œLarge Language Models and the Future of the Legal Profession”,...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 696

πŸ““ Popular Notebooks:
1. /devraai/ai-hallucination-cases-analysis-and-prediction
2. /sumedh1507/predicting-professional-sanction
3. /alexkhr/ai-hallucination-cases-dashboard-statistics

πŸ”— Powered by @datasets1
πŸ”₯1
πŸ“Š Healthcare Risk Factors Dataset

πŸ“ Description:
This dataset contains 30,000 records and 20 features related to individuals’ health conditions, along with some noisy/random columns. Medical Condition: Reported health condition (e.g., Diabetes, Hypertension, Asthma, Obesity, Healthy).

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 88

πŸ““ Popular Notebooks:
1. /alicansah1n/medicalconditionclassification-rf-lr-dt-knn-gnb
2. /esraamohamed2003/healthcare-risk-factors-dataset-visualization-task
3. /renjiabarai/medical-condition-prediction

πŸ”— Powered by @datasets1
πŸ“Š Chest X-Ray Dataset

πŸ“ Description:
This dataset merges multiple public chest X-ray sources into one classification set with three classes:

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 25,600
β€’ Downloads: 44

πŸ““ Popular Notebooks:
1. /zulqarnain11/x-ray-vision-cnn-for-lung-diseases
2. /hareekrishnavs/transfer-learning-auc
3. /stpeteishii/chest-x-ray-efficientnet-classifier

πŸ”— Powered by @datasets1
πŸ”₯1
πŸ“Š Health & Lifestyle Dataset

πŸ“ Description:
This dataset is a synthetically generated health & lifestyle dataset containing 100,000 records of individuals with various lifestyle and health indicators. It is designed to support machine learning, data science, and statistical modeling tasks such as: The dataset does not contain real personal in...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 94

πŸ““ Popular Notebooks:
1. /devraai/health-and-lifestyle-data-analysis-prediction
2. /emre21/kmeans-vs-breathe-kmeans
3. /sonawanelalitsunil/health-lifestyle-dataset-ml-75-17

πŸ”— Powered by @datasets1
πŸ“Š All Computer Prices

πŸ“ Description:
This is dataset contains all of my computer prices datasets merged into one dataset. With 100k rows of data, I have released a cleaned version of my original dataset. The columns should be self-explanatory, and your mission is to predict the price based on the features. If you are interested in this...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 63

πŸ““ Popular Notebooks:
1. /mahmoudredagamail/all-computer-prices
2. /paperxd/xgboost-starter-notebook
3. /mariamessam14/computer-price-prediction

πŸ”— Powered by @datasets1
πŸ“Š Steam - All games data

πŸ“ Description:
This dataset contains information on games that are tracked by Steam Spy and are published on Steam. You may find documentation on the APIs used to gather this data on: Data collected iterating through all pages from SteamSpy's API using the "request:all" parameter.

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 3
β€’ Downloads: 338

πŸ““ Popular Notebooks:
1. /lunthu/steam-genre-trends-pyspark-plotly
2. /lukhilaksh/steam-eda-and-pridiction
3. /lunthu/rise-of-strategy-games-trends-text-analysis

πŸ”— Powered by @datasets1
❀1
πŸ“Š Credit Card Fraud Detection

πŸ“ Description:
It contains only numerical input variables which are the result of a PCA transformation. Unfortunately, due to confidentiality issues, we cannot provide the original features and more background information about the data. Features V1, V2, … V28 are the principal components obtained with PCA, the on...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 276

πŸ““ Popular Notebooks:
1. /janiobachmann/credit-fraud-dealing-with-imbalanced-datasets
2. /gpreda/credit-card-fraud-detection-predictive-models
3. /joparga3/in-depth-skewed-data-classif-93-recall-acc-now

πŸ”— Powered by @datasets1
πŸ‘1
πŸ“Š Coffee_Sales _Dataset

πŸ“ Description:
This dataset contains coffee shop transaction records, including details about sales, payment type, time of purchase, and customer preferences. It is specifically curated for data visualization, dashboard building, and business analytics projects in tools like Power BI, Tableau, and Python visualiza...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 509

πŸ““ Popular Notebooks:
1. /tan5577/eda-coffee-sales
2. /devraai/coffee-sales-analysis-and-prediction
3. /mahmoudredagamail/coffee-sales-datasety

πŸ”— Powered by @datasets1
❀1
πŸ“Š Generative AI Market 2025

πŸ“ Description:
An organized summary of the quickly expanding generative artificial intelligence technology ecosystem can be found in the Generative AI Tools – Platforms 2025 dataset. It compiles data on a variety of platforms and tools for creating text, images, videos, code, and audio, emphasizing their features,...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 42

πŸ““ Popular Notebooks:
1. /sidraaazam/generative-ai-software-platforms
2. /alexkhr/genai-market-2025-open-source-prediction-acc-0-78
3. /alexkhr/generative-ai-market-2025-eda-statistics

πŸ”— Powered by @datasets1
πŸ“Š Recipes Dataset : 64k Dishes

πŸ“ Description:
This dataset is ideal for: recipe recommendation systems, NLP tasks, ingredient analysis, cooking apps, and food-related ML projects. All data is cleaned, consistent, and ready for use in both CSV and JSON formats. A curated recipe dataset with 64,000 recipes across 320+ categories. Includes ingredi...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 2
β€’ Downloads: 54

πŸ““ Popular Notebooks:
1. /wafaaelhusseini/augmenting-recipes-dataset-64k-dishes
2. /prashantsingh001/exploratory-data-analysis-of-64k-recipes-dataset
3. /alexkhr/recipes-dataset-clustering

πŸ”— Powered by @datasets1
πŸ“Š Dhaka Air Quality 2000-2025 - Synthetic Dataset

πŸ“ Description:
This dataset contains synthetically generated air quality data for Dhaka, Bangladesh, spanning 25 years from January 1, 2000 to October 4, 2025. The data was created to simulate realistic air quality patterns based on known characteristics of urban air pollution in South Asian megacities. This is sy...

πŸ“₯ Download:
β€’ Size: 50.1 MB
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 102

πŸ““ Popular Notebooks:
1. /shakil10945/dhaka-air-quality-2000-2025-synthetic-dataset-ge
2. /sumedh1507/dhaka-air-quality-analysis
3. /mrgulia/dhaka-air-quality-insights-and-model-accuracy-99-9

πŸ”— Powered by @datasets1
❀3
πŸ“Š Job Market - A 2025 Dataset

πŸ“₯ Download:
β€’ Size: File size: ~ (add actual size, e.g., 12 MB)
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 660

πŸ““ Popular Notebooks:
1. /mrgulia/basic
2. /udayrajkarki/job-market-basic-analysis
3. /devraai/job-market-data-analysis-and-salary-prediction

πŸ”— Powered by @datasets1
❀1
πŸ“Š Global House Purchase Decision Dataset

πŸ“ Description:
This dataset provides a global property purchase decisions with 200,000 records across 20+ countries and major cities. The dataset was generated with realistic distributions across global cities, making it useful for:

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 26

πŸ““ Popular Notebooks:
1. /mohankrishnathalla/house-prices-purchase-predicition
2. /devraai/global-house-purchase-decision-analysis
3. /harshjagda/globalhousepurchasedecisionanalysis

πŸ”— Powered by @datasets1
❀1
πŸ“Š Daily Coffee Transactions

πŸ“ Description:
For many people, coffee shops have become an integral part of their everyday lives, acting as both gathering places and places to stop for a quick refreshment. Knowing when customers come in, what they want to buy, and how they pay is crucial for operating a successful cafe. Examining sales trends, ...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 440

πŸ““ Popular Notebooks:
1. /mohammedmokhtar77/eda-of-coffee-transactions
2. /lunthu/coffee-sales-predictions-regression-rf-boosting
3. /minahilfatima12328/cafe-revenue-report

πŸ”— Powered by @datasets1
❀2
πŸ“Š Palmer Archipelago (Antarctica) penguin data

πŸ“ Description:
Please cite this data using: Gorman KB, Williams TD, Fraser WR (2014) Ecological Sexual Dimorphism and Environmental Variability within a Community of Antarctic Penguins (Genus Pygoscelis). PLoS ONE 9(3): e90081. doi:10.1371/journal.pone.0090081 Thank you to Dr. Gorman, Palmer Station LTER and the L...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 2
β€’ Downloads: 295

πŸ““ Popular Notebooks:
1. /jessemostipak/dive-into-dplyr-tutorial-1
2. /parulpandey/penguin-dataset-the-new-iris
3. /osobaseejale/dive-into-dplyr-tutorial-1

πŸ”— Powered by @datasets1
πŸ“Š GDP per Country 2020–2025

πŸ“ Description:
This dataset contains annual Gross Domestic Product (GDP) data for all recognized countries covering the years 2020 to 2025. Disputed territories are excluded. The dataset is compiled from IMF sources and is well-suited for studying global economic trends, forecasting, and analyzing the growth or de...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 110

πŸ““ Popular Notebooks:
1. /gpreda/world-countries-gdp-2020-2025
2. /yarendalkran/gdp-analysis-and-prediction-2020-2026
3. /sonawanelalitsunil/global-gdp-trends-2020-2025-ml-99-34

πŸ”— Powered by @datasets1
❀1
πŸ“Š Predicting Road Accident Risk | Vault

πŸ“ Description:
This vault collects the best-performing public submissions, offering optimized approaches, feature strategies, and model architectures from top competitors.

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 2
β€’ Downloads: 179

πŸ““ Popular Notebooks:
1. /anthonytherrien/road-accident-risk-blend
2. /anthonytherrien/s5e10-nn-stacking
3. /astitwaagarwal/predicting-road-accident-risk

πŸ”— Powered by @datasets1
Unlock premium learning without spending a dime! ⭐️ @DataScienceC is the first Telegram channel dishing out free Udemy coupons dailyβ€”grab courses on data science, coding, AI, and beyond. Join the revolution and boost your skills for free today! πŸ“•

What topic are you itching to learn next? 😊
https://t.me/DataScienceC 🌟
Please open Telegram to view this post
VIEW IN TELEGRAM
πŸ“Š Exploring Coffee Sales with EDA & Visualization

πŸ“ Description:
This dataset contains 3,547 coffee shop sales record collected over different days and months. It includes details such as the type of coffee purchased, transaction amount, payment method, date, and time-related attributes (hour, weekday, and month). The data can be used for sales trend analysis, cu...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 901

πŸ““ Popular Notebooks:
1. /wardabilal/eda-coffee-sales
2. /alexkhr/offee-sales-comprehensive-analysis-prediction
3. /devraai/coffee-sales-eda-and-linear-regression-model

πŸ”— Powered by @datasets1
πŸ“Š Adult Census Income Dataset

πŸ“ Description:
The popular Adult Income dataset from the UCI Machine Learning Repository has been converted into the file Adult Census Income (564 Γ— 284). This version of the dataset has 564 records and 284 attributes, as opposed to the original's 32,561 records and 15 atributes. Because feature enginering and one...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 232

πŸ““ Popular Notebooks:
1. /emanfatima2025/eda-of-adult-census-income-data
2. /sumedh1507/adult-income-prediction
3. /devraai/adult-census-income-analysis-and-prediction

πŸ”— Powered by @datasets1
πŸ“Š Top 100 Hits songs of 2025

πŸ“ Description:
The Top 100 Songs 2025 dataset provides a collection of information about the most popular songs of the year 2025. This dataset is designed for music analysts, researchers, marketers, etc who want to explore trends in music consuption, genre popularity, and artist performance. It collect data from m...

πŸ“₯ Download:
β€’ Size: Size not specified
β€’ Direct API: Download Link

πŸ“Š Stats:
β€’ Files: 1
β€’ Downloads: 245

πŸ““ Popular Notebooks:
1. /shivamja/top-100-songs-eda-2025
2. /mubeenshehzadi/eda-of-hits-songs-2025
3. /sonawanelalitsunil/top-100-hits-of-2025-ml-0-75

πŸ”— Powered by @datasets1