Kaggle Data Hub
29.2K subscribers
962 photos
15 videos
309 files
1.23K links
Your go-to hub for Kaggle datasets โ€“ explore, analyze, and leverage data for Machine Learning and Data Science projects.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Dataset Name: Indicators of Heart Disease (2022 UPDATE)
Basic Description: 2022 annual CDC survey data of 400k+ adults related to their health status

๐Ÿ“– FULL DATASET DESCRIPTION:
==================================
According to the CDC, heart disease is a leading cause of death for people of most races in the U.S. (African Americans, American Indians and Alaska Natives, and whites). About half of all Americans (47%) have at least 1 of 3 major risk factors for heart disease: high blood pressure, high cholesterol, and smoking. Other key indicators include diabetes status, obesity (high BMI), not getting enough physical activity, or drinking too much alcohol. Identifying and preventing the factors that have the greatest impact on heart disease is very important in healthcare. In turn, developments in computing allow the application of machine learning methods to detect "patterns" in the data that can predict a patient's condition.
The dataset originally comes from the CDC and is a major part of the Behavioral Risk Factor Surveillance System (BRFSS), which conducts annual telephone surveys to collect data on the health status of U.S. residents. As described by the CDC: "Established in 1984 with 15 states, BRFSS now collects data in all 50 states, the District of Columbia, and three U.S. territories. BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world. The most recent dataset includes data from 2023. In this dataset, I noticed many factors (questions) that directly or indirectly influence heart disease, so I decided to select the most relevant variables from it. I also decided to share with you two versions of the most recent dataset: with NaNs and without it.

๐Ÿ“ฅ DATASET DOWNLOAD INFORMATION
==================================

๐Ÿ”ด Dataset Size: Download dataset as zip (22 MB)

๐Ÿ”ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kamilpytlak/personal-key-indicators-of-heart-disease

๐Ÿ“Š Additional information:
==================================
File count not found
Views: 503,000
Downloads: 90,500

๐Ÿ“š RELATED NOTEBOOKS:
==================================
1. Heart Disease. Exploratory data analysis. | Upvotes: 981
URL: https://www.kaggle.com/code/georgyzubkov/heart-disease-exploratory-data-analysis

2. Diabetes Health Indicators Dataset | Upvotes: 771
URL: https://www.kaggle.com/datasets/alexteboul/diabetes-health-indicators-dataset

3. Heart Disease Prediction | Upvotes: 736
URL: https://www.kaggle.com/code/andls555/heart-disease-prediction

4. Advance Data Preprocessing | Upvotes: 711
URL: https://www.kaggle.com/code/nkitgupta/advance-data-preprocessing

5. Coronary Heart Disease Prediction in Ten Years | Upvotes: 2
URL: https://www.kaggle.com/datasets/palakdoshijain/coronary-heart-disease-prediction-in-ten-years

==================================
โญ๏ธ By: https://t.me/datasets1
โค1
Forwarded from Machine Learning
๐Ÿ”ฅ Trending Repository: awesome-public-datasets

๐Ÿ“ Description: A topic-centric list of HQ open datasets.

๐Ÿ”— Repository URL: https://github.com/awesomedata/awesome-public-datasets

๐ŸŒ Website: https://awesomedataworld.slack.com

๐Ÿ“– Readme: https://github.com/awesomedata/awesome-public-datasets#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 64.6K stars
๐Ÿ‘€ Watchers: 2.3k
๐Ÿด Forks: 10.3K forks

๐Ÿ’ป Programming Languages: Not available

๐Ÿท๏ธ Related Topics:
#opendata #datasets #aaron_swartz #awesome_public_datasets


==================================
๐Ÿง  By: https://t.me/DataScienceM
โค1
Dataset Name: Energy consumption of the Netherlands
Basic Description: Electricity and Gas consumed in the Netherlands every year

๐Ÿ“– FULL DATASET DESCRIPTION:
==================================
The energy network of the Netherlands is managed by a few companies. Every year, these companies release on their websites a table with the energy consumption of the areas under their administration. The companies are
The data are anonymized by aggregating the Zipcodes so that every entry describes at least 10 connections.
This market is not competitive, meaning that the zones are assigned. This means that every year they roughly provide energy to the same zipcodes. Small changes can happen from year to year either for a change of management or for a different aggregation of zipcodes.
Every file contains information about groups of zipcodes managed by one of the three companies for a specific year.

๐Ÿ“ฅ DATASET DOWNLOAD INFORMATION
==================================

๐Ÿ”ด Dataset Size: Download dataset as zip (146 MB)

๐Ÿ”ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/lucabasa/dutch-energy

๐Ÿ“Š Additional information:
==================================
File count not found
Views: 139,000
Downloads: 111,000

๐Ÿ“š RELATED NOTEBOOKS:
==================================
1. ใ€ฝ๏ธ|3๏ธโƒฃWays to Deal with Time Series Forecasting | Upvotes: 296
URL: https://www.kaggle.com/code/mfaaris/3-ways-to-deal-with-time-series-forecasting

2. Dutch electricity: EDA, FS, clustering, maps | Upvotes: 199
URL: https://www.kaggle.com/code/bberghuis/dutch-electricity-eda-fs-clustering-maps

3. โšก๏ธEnergy EDA๐Ÿ“Š, Segmentation and Prediction๐Ÿงฎ | Upvotes: 86
URL: https://www.kaggle.com/code/ihsncnkz/energy-eda-segmentation-and-prediction

4. Western Europe Power Consumption | Upvotes: 34
URL: https://www.kaggle.com/datasets/francoisraucent/western-europe-power-consumption

5. Monthly Electricity Production in GWh [2010-2022] | Upvotes: 26
URL: https://www.kaggle.com/datasets/ccanb23/iea-monthly-electricity-statistics

==================================
โญ๏ธ By: https://t.me/datasets1
โค6
Dataset Name: Tuberculosis (TB) Chest X-ray Database
Basic Description: The largest TB Chest X-ray Database

๐Ÿ“– FULL DATASET DESCRIPTION:
==================================
Tuberculosis (TB) Chest X-ray Database A team of researchers from Qatar University, Doha, Qatar, and the University of Dhaka, Bangladesh along with their collaborators from Malaysia in collaboration with medical doctors from Hamad Medical Corporation and Bangladesh have created a database of chest X-ray images for Tuberculosis (TB) positive cases along with Normal images. In our current release, there are 700 TB images publicly accessible and 2800 TB images can be downloaded from NIAID TB portal[3] by signing an agreement, and 3500 normal images.
Note: -The research team managed to classify TB and Normal Chest X-ray images with an accuracy of 98.3%. This scholarly work is published in IEEE Access. Please make sure you give credit to us while using the dataset, code, and trained models.
Credit should go to the following: Tawsifur Rahman, Amith Khandakar, Muhammad A. Kadir, Khandaker R. Islam, Khandaker F. Islam, Zaid B. Mahbub, Mohamed Arselene Ayari, Muhammad E. H. Chowdhury. (2020) "Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization". IEEE Access, Vol. 8, pp 191586 - 191601. DOI. 10.1109/ACCESS.2020.3031384. Paper Link
To view images please check image folders and references of each image are provided in the metadata.csv.
Research Team members and their affiliation Muhammad E. H. Chowdhury, PhD (mchowdhury@qu.edu.qa) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Tawsifur Rahman (tawsifurrahman.1426@gmail.com) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Amith Khandakar (amitk@qu.edu.qa) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Rashid Mazhar, MD Thoracic Surgery, Hamad General Hospital, Doha-3050, Qatar Muhammad Abdul Kadir, PhD Department of Biomedical Physics & Technology, University of Dhaka, Dhaka-1000, Bangladesh Zaid Bin Mahbub, PhD Department of Mathematics and Physics, North South University, Dhaka-1229, Bangladesh Khandakar R. Islam, MD Department of Orthodontics, Bangabandhu Sheikh Mujib Medical University, Dhaka-1000, Bangladesh

๐Ÿ“ฅ DATASET DOWNLOAD INFORMATION
==================================

๐Ÿ”ด Dataset Size: Download dataset as zip (696 MB)

๐Ÿ”ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/tawsifurrahman/tuberculosis-tb-chest-xray-dataset

๐Ÿ“Š Additional information:
==================================
File count not found
Views: 171,000
Downloads: 29,200

๐Ÿ“š RELATED NOTEBOOKS:
==================================
1. Tuberculosis_classification_DenseNet121_GradCAM | Upvotes: 219
URL: https://www.kaggle.com/code/sanphats/tuberculosis-classification-densenet121-gradcam

2. Pneumonia Detections Using Deep Learning | Upvotes: 214
URL: https://www.kaggle.com/code/chanchal24/pneumonia-detections-using-deep-learning

3. tuberculosis_0.99 accuracy | Upvotes: 168
URL: https://www.kaggle.com/code/akshayr009/tuberculosis-0-99-accuracy

4. DA and DB - TB Chest X-ray Datasets | Upvotes: 4
URL: https://www.kaggle.com/datasets/vbookshelf/da-and-db-tb-chest-x-ray-datasets

5. Dataset (Covid-Bacterial-Viral-Normal-Emphysema) | Upvotes: 1
URL: https://www.kaggle.com/datasets/minhnhat232/dataset-covid-bacterial-viral-normal-emphysema

==================================
โญ๏ธ By: https://t.me/datasets1
โค5
Dataset Name: NBA Database
Basic Description: Daily Updated SQLite Database โ€” 64,000+ Games, 4800+ Players, and 30 Teams ๐Ÿ€

๐Ÿ“– FULL DATASET DESCRIPTION:
==================================
This dataset is updated daily and includes:
โฎ• View the and โฎ• Sponsor project:

๐Ÿ“ฅ DATASET DOWNLOAD INFORMATION
==================================

๐Ÿ”ด Dataset Size: Download dataset as zip (731 MB)

๐Ÿ”ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/wyattowalsh/basketball

๐Ÿ“Š Additional information:
==================================
File count not found
Views: 353,000
Downloads: 46,700

๐Ÿ“š RELATED NOTEBOOKS:
==================================
1. Historic NBA Drafting, Game, and Player Analysis | Upvotes: 263
URL: https://www.kaggle.com/code/agilesifaka/historic-nba-drafting-game-and-player-analysis

2. NBA Stats (1947-present) | Upvotes: 195
URL: https://www.kaggle.com/datasets/sumitrodatta/nba-aba-baa-stats

3. Database Updater (Daily) | Upvotes: 58
URL: https://www.kaggle.com/code/wyattowalsh/database-updater-daily

4. Using SQL | Upvotes: 58
URL: https://www.kaggle.com/code/wyattowalsh/using-sql

5. NBA Games Box Score Since 1949 | Upvotes: 11
URL: https://www.kaggle.com/datasets/rafaelgreca/nba-games-box-score-since-1949

==================================
โญ๏ธ By: https://t.me/datasets1
โค8
๐Ÿ”ฅ Trending Repository: public-apis

๐Ÿ“ Description: A collective list of free APIs

๐Ÿ”— Repository URL: https://github.com/public-apis/public-apis

๐ŸŒ Website: https://APILayer.com/?utm_source=Github&utm_medium=Referral&utm_campaign=Public-apis-repo

๐Ÿ“– Readme: https://github.com/public-apis/public-apis#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 369K stars
๐Ÿ‘€ Watchers: 4.4k
๐Ÿด Forks: 38.8K forks

๐Ÿ’ป Programming Languages: Python - Shell

๐Ÿท๏ธ Related Topics:
#api #lists #open_source #list #development #public #resources #dataset #free #software #apis #public_api #public_apis


==================================
๐Ÿง  By: https://t.me/DataScienceM
โค5
Forwarded from Machine Learning
๐Ÿ“Œ Creating a Streamlit App for Satellite Imagery Visualization: A Step-by-Step Guide

๐Ÿ—‚ Category: DATA VISUALIZATION

๐Ÿ•’ Date: 2024-06-21 | โฑ๏ธ Read time: 11 min read

Explore any point on Earth at any time using satellite data with Streamlit
โค4
Enjoy our content? Advertise on this channel and reach a highly engaged audience! ๐Ÿ‘‰๐Ÿป

It's easy with Telega.io. As the leading platform for native ads and integrations on Telegram, it provides user-friendly and efficient tools for quick and automated ad launches.

โšก๏ธ Place your ad here in three simple steps:

Sign up

2 Top up the balance in a convenient way

3 Create your advertising post

If your ad aligns with our content, weโ€™ll gladly publish it.

Start your promotion journey now!
โค4
Tomorrow we will republish all of Data Set, are you excited?
๐Ÿ”ฅ11โค1
Forwarded from Free Online Courses
โญ๏ธ Hello my advertiser friend!

Iโ€™m Eng. Hussein Sheikho ๐Ÿ‘‹ and Iโ€™m excited to share our special promotional offer with you! ๐ŸŽฏ

๐Ÿ’ฅ Promo Offer:
Promote your ad across all our listed channels for only $35! ๐Ÿ’ฐ
๐Ÿ“ข We accept all types and formats of advertisements.

โœ… Publishing Plan:
Your ad will be published for 20 days across all our channels,
plus it will be pinned for 7 days ๐Ÿ”

๐Ÿง‘โ€๐Ÿ’ป For Programming Channel Owners Only:
Want your tech channel to grow fast? ๐Ÿš€
You can add your channel to our promo folder for just $20/month โ€”
average growth rate 2000+ subscribers/month ๐Ÿ“ˆ

๐Ÿ“ฉ Contact me for more details:
๐Ÿ‘‰ t.me/HusseinSheikho

๐ŸŒฑ Letโ€™s grow together!

Our Share folder (our channels) ๐Ÿ‘‡
https://t.me/addlist/8_rRW2scgfRhOTc0
Please open Telegram to view this post
VIEW IN TELEGRAM
โค3
Kaggle Data Hub pinned ยซโญ๏ธ Hello my advertiser friend! Iโ€™m Eng. Hussein Sheikho ๐Ÿ‘‹ and Iโ€™m excited to share our special promotional offer with you! ๐ŸŽฏ ๐Ÿ’ฅ Promo Offer: Promote your ad across all our listed channels for only $35! ๐Ÿ’ฐ ๐Ÿ“ข We accept all types and formats of advertisements.โ€ฆยป
๐Ÿ“Š Global Earthquake-Tsunami Risk Assessment Dataset

๐Ÿ“ Description:
The Global Earthquake-Tsunami Risk Assessment Dataset is a comprehensive, machine learning-ready dataset containing seismic characteristics and tsunami potential indicators for 782 significant earthquakes recorded globally from 2001 to 2022. This dataset is specifically designed for tsunami risk pre...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 216,000

๐Ÿ““ Popular Notebooks:
1. /timurtaepov/tsunami-risk-prediction-with-geo-rbf
2. /lightonkalumba/predicting-tsunamis-a-deep-dive-2001-2022
3. /hsantaymas/tsunami-prediction-debunking-94-accuracy

๐Ÿ”— Powered by @datasets1
โค4๐Ÿ”ฅ1
๐Ÿ“Š Hospital Beds Management

๐Ÿ“ Description:
This collection of synthetic hospital datasets is designed to simulate real-world operations for a medium-sized hospital, focusing on staffing, patient admissions, and bed allocation among services. The data allows for exploration and analysis of hospital resource distribution, including personnel d...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 4
โ€ข Downloads: 490,000

๐Ÿ““ Popular Notebooks:
1. /calebboen/predicting-patient-satisfaction
2. /khadijaisse/managing-hospital-beds
3. /moneebarifbbs/hospitalmangment

๐Ÿ”— Powered by @datasets1
โค5๐Ÿ‘1
๐Ÿ“Š Life Style Data

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 2
โ€ข Downloads: 216,000

๐Ÿ““ Popular Notebooks:
1. /jockeroika/life-style-analysis
2. /jockeroika/health-lifestyle-recommendation-system
3. /jockeroika/workout-recommendation-system

๐Ÿ”— Powered by @datasets1
โค3
๐Ÿ“Š BMW Worldwide Sales Records (2010โ€“2024)

๐Ÿ“ Description:
This dataset provides detailed sales information for BMW vehicles from 2010 to 2024 across global regions. It includes attributes such as model, year, engine size, mileage, transmission type, fuel type, price, and sales volume. Scholars and analysts can use it to explore market trends, pricing appro...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 414,000

๐Ÿ““ Popular Notebooks:
1. /devraai/bmw-sales-analysis-and-prediction-20102024
2. /sonawanelalitsunil/bmw-global-sales-trends-2010-24-ml-100
3. /elmarhagverdiyev/bmw-sales-analysis-2010-2024

๐Ÿ”— Powered by @datasets1
โค3
๐Ÿ“Š Student exam score dataset analysis

๐Ÿ“ Description:
This dataset shows all information about student performance in exam. so exam score related with student study habits and background to support analysis of student performance. This dataset use in college, school and university ect, for student exam score student are pass or fail. This dataset are c...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 216,000

๐Ÿ““ Popular Notebooks:
1. /ahmedelawady8/student-exam-score-prediction
2. /chitranshusinha/student-exam-score-prediction
3. /quetanit/student-examination-lsm-84-68

๐Ÿ”— Powered by @datasets1
โค2๐Ÿ”ฅ1
๐Ÿ“Š Shopping_Behavior_Dataset

๐Ÿ“ Description:
This dataset contains detailed information about customer demographics and shopping behavior. It includes attributes such as gender, age group, time period income, and spending score, providing insights into how consumer characteristics influence purchasing decisions. The data can be used for market...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 414,000

๐Ÿ““ Popular Notebooks:
1. /devraai/shopping-behavior-analysis-and-purchase-prediction
2. /sumedh1507/shopping-behaviour-insights
3. /alexkhr/omprehensive-multi-method-clustering-and-insights

๐Ÿ”— Powered by @datasets1
โค3๐Ÿ”ฅ1
๐Ÿ“Š Health Lifestyle Dataset

๐Ÿ“ Description:
This dataset is useful for students, researchers, and data enthusiasts who want to practice data analysis, visualization, and machine learning in the health domain. โœ… This dataset can be used for health analysis, disease prediction, and exploring lifestyle impacts on human health.

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 264,000

๐Ÿ““ Popular Notebooks:
1. /tan5577/eda-for-healthy-life-style-2025
2. /rehan497/health-lifestyle-data-analysis
3. /devraai/health-and-lifestyle-data-analysis-and-prediction

๐Ÿ”— Powered by @datasets1
๐Ÿ”ฅ1
๐Ÿ“Š Student Success: Factors & Insights

๐Ÿ“ Description:
This dataset contains information of about 6,590 students and the factors that may affect their academic performance. It includes variables such as study habits, attendance, parental involvement, access to resources, extracurricular activities, sleep hours, motivation, and socio-economic background....

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 348,000

๐Ÿ““ Popular Notebooks:
1. /iqranaz240/data-visualization-pandas
2. /sonawanelalitsunil/student-success-insights-eda-ml-94-10
3. /sumedh1507/predicting-exam-score

๐Ÿ”— Powered by @datasets1
โค2๐Ÿ”ฅ1
๐Ÿ“Š Mental Health & Social Media Balance Dataset

๐Ÿ“ Description:
This dataset captures the delicate relationship between social media habits and mental well-being. It combines variables such as screen time, stress level, sleep quality, digital detox days, and happiness index. Ideal for regression, correlation, or mental health prediction tasks. This dataset captu...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 137,000

๐Ÿ““ Popular Notebooks:
1. /devraai/mental-health-and-social-media-data-analysis
2. /mahmoudredagamail/mental-health-social-media-balance-dataset
3. /miadul/mental-health-social-media-behavior-analysis

๐Ÿ”— Powered by @datasets1
๐Ÿ”ฅ1
๐Ÿ“Š ๐ŸŒ World Population by Country 2025 (Latest)

๐Ÿ“ Description:
๐Ÿ“ˆ Whether youโ€™re analyzing Asiaโ€™s population boom, Europeโ€™s aging curve, or Africaโ€™s youthful surge โ€” this dataset gives you a complete view of the worldโ€™s demographic balance in 2025. ๐ŸŒŽ With 233 rows and 12 insightful columns, itโ€™s ready for your next EDA, visualization, or predictive modeling proj...

๐Ÿ“ฅ Download:
โ€ข Size: Size not specified
โ€ข Direct API: Download Link

๐Ÿ“Š Stats:
โ€ข Files: 1
โ€ข Downloads: 216,000

๐Ÿ““ Popular Notebooks:
1. /wafaaelhusseini/analyzing-the-world-population-2025
2. /devraai/world-population-2025-analysis-and-prediction
3. /yigitcanbltc/the-human-development-matrix

๐Ÿ”— Powered by @datasets1
โค4๐Ÿ”ฅ1