Dataset Name: Linked In Job Postings (2023 - 2024)
Basic Description: LinkedIn Job Postings (2023 - 2024)
๐ FULL DATASET DESCRIPTION:
==================================
Scraper Code - https://github.com/ArshKA/LinkedIn-Job-Scraper
Every day, thousands of companies and individuals turn to LinkedIn in search of talent. This dataset contains a nearly comprehensive record of 124,000+ job postings listed in 2023 and 2024. Each individual posting contains dozens of valuable attributes for both postings and companies, including the title, job description, salary, location, application URL, and work-types (remote, contract, etc), in addition to separate files containing the benefits, skills, and industries associated with each posting. The majority of jobs are also linked to a company, which are all listed in another csv file containing attributes such as the company description, headquarters location, and number of employees, and follower count.
With so many datapoints, the potential for exploration of this dataset is vast and includes exploring the highest compensated titles, companies, and locations; predicting salaries/benefits through NLP; and examining how industries and companies vary through their internship offerings and benefits. Future updates will permit further exploration into time-based trends, including company growth, prevalence of remote jobs, and demand of individual job titles over time.
Thank you to @zoeyyuzou for scraping an additional 100,000 jobs
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (166 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arshkon/linkedin-job-postings
๐ Additional information:
==================================
File count not found
Views: 126,000
Downloads: 53,100
๐ RELATED NOTEBOOKS:
==================================
1. "Decoding the Job Market: An In-depth Exploration | Upvotes: 84
URL: https://www.kaggle.com/code/pratul007/decoding-the-job-market-an-in-depth-exploration
2. LinkedIn Job Postings 2023 Data Analysis | Upvotes: 58
URL: https://www.kaggle.com/code/enricofindley/linkedin-job-postings-2023-data-analysis
3. LinkedIn_job_data | Upvotes: 49
URL: https://www.kaggle.com/datasets/shashankshukla123123/linkedin-job-data
4. Text-based Resume Filtering Tool | Upvotes: 46
URL: https://www.kaggle.com/code/hasanz9/text-based-resume-filtering-tool
5. International Job Postings September 2021 | Upvotes: 19
URL: https://www.kaggle.com/datasets/techmap/international-job-postings-september-2021
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: LinkedIn Job Postings (2023 - 2024)
๐ FULL DATASET DESCRIPTION:
==================================
Scraper Code - https://github.com/ArshKA/LinkedIn-Job-Scraper
Every day, thousands of companies and individuals turn to LinkedIn in search of talent. This dataset contains a nearly comprehensive record of 124,000+ job postings listed in 2023 and 2024. Each individual posting contains dozens of valuable attributes for both postings and companies, including the title, job description, salary, location, application URL, and work-types (remote, contract, etc), in addition to separate files containing the benefits, skills, and industries associated with each posting. The majority of jobs are also linked to a company, which are all listed in another csv file containing attributes such as the company description, headquarters location, and number of employees, and follower count.
With so many datapoints, the potential for exploration of this dataset is vast and includes exploring the highest compensated titles, companies, and locations; predicting salaries/benefits through NLP; and examining how industries and companies vary through their internship offerings and benefits. Future updates will permit further exploration into time-based trends, including company growth, prevalence of remote jobs, and demand of individual job titles over time.
Thank you to @zoeyyuzou for scraping an additional 100,000 jobs
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (166 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arshkon/linkedin-job-postings
๐ Additional information:
==================================
File count not found
Views: 126,000
Downloads: 53,100
๐ RELATED NOTEBOOKS:
==================================
1. "Decoding the Job Market: An In-depth Exploration | Upvotes: 84
URL: https://www.kaggle.com/code/pratul007/decoding-the-job-market-an-in-depth-exploration
2. LinkedIn Job Postings 2023 Data Analysis | Upvotes: 58
URL: https://www.kaggle.com/code/enricofindley/linkedin-job-postings-2023-data-analysis
3. LinkedIn_job_data | Upvotes: 49
URL: https://www.kaggle.com/datasets/shashankshukla123123/linkedin-job-data
4. Text-based Resume Filtering Tool | Upvotes: 46
URL: https://www.kaggle.com/code/hasanz9/text-based-resume-filtering-tool
5. International Job Postings September 2021 | Upvotes: 19
URL: https://www.kaggle.com/datasets/techmap/international-job-postings-september-2021
==================================
โญ๏ธ By: https://t.me/datasets1
Dataset Name: Online Payments Fraud Detection Dataset
Basic Description: Online payment fraud big dataset for testing and practice purpose
๐ FULL DATASET DESCRIPTION:
==================================
The below column reference:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (186 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/rupakroy/online-payments-fraud-detection-dataset
๐ Additional information:
==================================
File count not found
Views: 161,000
Downloads: 27,300
๐ RELATED NOTEBOOKS:
==================================
1. Feature Engineering and Feature Selection | Upvotes: 358
URL: https://www.kaggle.com/code/nkitgupta/feature-engineering-and-feature-selection
2. Online Payments Fraud Detection | Upvotes: 242
URL: https://www.kaggle.com/code/ananthu19/online-payments-fraud-detection
3. Online Payments Fraud Detection Project | Upvotes: 90
URL: https://www.kaggle.com/code/nehahatti/online-payments-fraud-detection-project
4. credit card fraud dataset | Upvotes: 2
URL: https://www.kaggle.com/datasets/visheshbairwa/credit-card-fraud-dataset
5. Fraud Transactions Dataset | Upvotes: 1
URL: https://www.kaggle.com/datasets/tanayatipre/fraud-transactions-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Online payment fraud big dataset for testing and practice purpose
๐ FULL DATASET DESCRIPTION:
==================================
The below column reference:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (186 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/rupakroy/online-payments-fraud-detection-dataset
๐ Additional information:
==================================
File count not found
Views: 161,000
Downloads: 27,300
๐ RELATED NOTEBOOKS:
==================================
1. Feature Engineering and Feature Selection | Upvotes: 358
URL: https://www.kaggle.com/code/nkitgupta/feature-engineering-and-feature-selection
2. Online Payments Fraud Detection | Upvotes: 242
URL: https://www.kaggle.com/code/ananthu19/online-payments-fraud-detection
3. Online Payments Fraud Detection Project | Upvotes: 90
URL: https://www.kaggle.com/code/nehahatti/online-payments-fraud-detection-project
4. credit card fraud dataset | Upvotes: 2
URL: https://www.kaggle.com/datasets/visheshbairwa/credit-card-fraud-dataset
5. Fraud Transactions Dataset | Upvotes: 1
URL: https://www.kaggle.com/datasets/tanayatipre/fraud-transactions-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
โค4๐2
Dataset Name: Gallstone Dataset (UCI)
Basic Description: Gallstone Dataset (UCI Machine Learning Repository)
๐ FULL DATASET DESCRIPTION:
==================================
Overview
The clinical dataset was collected from the Internal Medicine Outpatient Clinic of Ankara VM Medical Park Hospital and includes data from 319 individuals (June 2022โJune 2023), 161 of whom were diagnosed with gallstone disease. It contains 38 features, including demographic, bioimpedance, and laboratory data, and was ethically approved by the Ankara City Hospital Ethics Committee (E2-23-4632). Demographic variables are age, sex, height, weight, and BMI. Bioimpedance data includes total, extracellular, and intracellular water; muscle and fat mass; protein; visceral fat area; and hepatic fat. Laboratory features are glucose, total cholesterol, HDL, LDL, triglycerides, AST, ALT, ALP, creatinine, GFR, CRP, hemoglobin, and vitamin D. The dataset is complete, with no missing values, and balanced in terms of disease status, eliminating the need for additional preprocessing. It provides a strong foundation for machine learning-based gallstone prediction using non-imaging features.
Context
Gallstone disease is a common gastrointestinal disorder characterized by the formation of solid particles (gallstones) in the gallbladder. These stones can lead to complications like inflammation, infection, and obstruction of the bile ducts. Understanding the medical, demographic, and lifestyle factors associated with gallstone formation is crucial for early diagnosis and prevention strategies.
This dataset has been curated to support research, exploratory data analysis (EDA), and machine learning tasks focused on gallstone risk prediction and clinical decision-making.
Dataset Details
Size: The dataset has 319 rows and 40 columns.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (81 kB)
๐ฐ Direct dataset download link:
URL not found
๐ Additional information:
==================================
File count not found
Views: 1,128
Downloads: 246
๐ RELATED NOTEBOOKS:
==================================
1. Heart Attack Risk Prediction Dataset | Upvotes: 274
URL: https://www.kaggle.com/datasets/iamsouravbanerjee/heart-attack-prediction-dataset
2. Diabetes_prediction_dataset | Upvotes: 88
URL: https://www.kaggle.com/datasets/marshalpatel3558/diabetes-prediction-dataset
3. ๐ฉบ Chronic Kidney Disease Dataset ๐ฉบ | Upvotes: 60
URL: https://www.kaggle.com/datasets/rabieelkharoua/chronic-kidney-disease-dataset-analysis
4. Getting_Started_with_Gallstone| EDA | ML | Upvotes: 10
URL: https://www.kaggle.com/code/xixama/getting-started-with-gallstone-eda-ml
5. XGBoost Gallstone Prediction Model (FIV) | Upvotes: 9
URL: https://www.kaggle.com/code/rafipratamag/xgboost-gallstone-prediction-model-fiv
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Gallstone Dataset (UCI Machine Learning Repository)
๐ FULL DATASET DESCRIPTION:
==================================
Overview
The clinical dataset was collected from the Internal Medicine Outpatient Clinic of Ankara VM Medical Park Hospital and includes data from 319 individuals (June 2022โJune 2023), 161 of whom were diagnosed with gallstone disease. It contains 38 features, including demographic, bioimpedance, and laboratory data, and was ethically approved by the Ankara City Hospital Ethics Committee (E2-23-4632). Demographic variables are age, sex, height, weight, and BMI. Bioimpedance data includes total, extracellular, and intracellular water; muscle and fat mass; protein; visceral fat area; and hepatic fat. Laboratory features are glucose, total cholesterol, HDL, LDL, triglycerides, AST, ALT, ALP, creatinine, GFR, CRP, hemoglobin, and vitamin D. The dataset is complete, with no missing values, and balanced in terms of disease status, eliminating the need for additional preprocessing. It provides a strong foundation for machine learning-based gallstone prediction using non-imaging features.
Context
Gallstone disease is a common gastrointestinal disorder characterized by the formation of solid particles (gallstones) in the gallbladder. These stones can lead to complications like inflammation, infection, and obstruction of the bile ducts. Understanding the medical, demographic, and lifestyle factors associated with gallstone formation is crucial for early diagnosis and prevention strategies.
This dataset has been curated to support research, exploratory data analysis (EDA), and machine learning tasks focused on gallstone risk prediction and clinical decision-making.
Dataset Details
Size: The dataset has 319 rows and 40 columns.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (81 kB)
๐ฐ Direct dataset download link:
URL not found
๐ Additional information:
==================================
File count not found
Views: 1,128
Downloads: 246
๐ RELATED NOTEBOOKS:
==================================
1. Heart Attack Risk Prediction Dataset | Upvotes: 274
URL: https://www.kaggle.com/datasets/iamsouravbanerjee/heart-attack-prediction-dataset
2. Diabetes_prediction_dataset | Upvotes: 88
URL: https://www.kaggle.com/datasets/marshalpatel3558/diabetes-prediction-dataset
3. ๐ฉบ Chronic Kidney Disease Dataset ๐ฉบ | Upvotes: 60
URL: https://www.kaggle.com/datasets/rabieelkharoua/chronic-kidney-disease-dataset-analysis
4. Getting_Started_with_Gallstone| EDA | ML | Upvotes: 10
URL: https://www.kaggle.com/code/xixama/getting-started-with-gallstone-eda-ml
5. XGBoost Gallstone Prediction Model (FIV) | Upvotes: 9
URL: https://www.kaggle.com/code/rafipratamag/xgboost-gallstone-prediction-model-fiv
==================================
โญ๏ธ By: https://t.me/datasets1
โค3
Dataset Name: Glaucoma Fundus Imaging Datasets
Basic Description: Fundus images and OD/OC masks from ORIGA, REFUGE, and G1020 datasets
๐ FULL DATASET DESCRIPTION:
==================================
Contains fundus images and corresponding optic disc/cup segmentations and Glaucoma diagnosis information from three datasets: ORIGA, REFUGE, and G1020. Links to the corresponding papers are below.
ORIGA: https://pubmed.ncbi.nlm.nih.gov/21095735/ REFUGE: https://ieee-dataport.org/documents/refuge-retinal-fundus-glaucoma-challenge G1020: https://arxiv.org/abs/2006.09158
The datasets were modified to add fundus images cropped to the region of the optic disc and cup, and the corresponding masks (Images_Cropped and Masks_Cropped).
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (6 GB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arnavjain1/glaucoma-datasets
๐ Additional information:
==================================
Total files: 21,600
Views: 50,600
Downloads: 8,813
๐ RELATED NOTEBOOKS:
==================================
1. UNET Segmentation of OC/OD | Upvotes: 155
URL: https://www.kaggle.com/code/arnavjain1/unet-segmentation-of-oc-od
2. u_NET_GLAUCOMA DETECTION_FUDNDUS_PYTORCH | Upvotes: 41
URL: https://www.kaggle.com/code/chukwuebukaanulunko/u-net-glaucoma-detection-fudndus-pytorch
3. OC/OD segmentation using keras | Upvotes: 38
URL: https://www.kaggle.com/code/vuppalaadithyasairam/oc-od-segmentation-using-keras
4. Glaucoma OCT Scans (Origa) Augmented Dataset | Upvotes: 10
URL: https://www.kaggle.com/datasets/scipygaurav/glaucoma-oct-scans-origa-augmented-dataset
5. Glaucoma - OD/CD segmentation for YOLO. | Upvotes: 0
URL: https://www.kaggle.com/datasets/nhnthanhj/glaucoma-odcd-segmentation-for-yolo
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Fundus images and OD/OC masks from ORIGA, REFUGE, and G1020 datasets
๐ FULL DATASET DESCRIPTION:
==================================
Contains fundus images and corresponding optic disc/cup segmentations and Glaucoma diagnosis information from three datasets: ORIGA, REFUGE, and G1020. Links to the corresponding papers are below.
ORIGA: https://pubmed.ncbi.nlm.nih.gov/21095735/ REFUGE: https://ieee-dataport.org/documents/refuge-retinal-fundus-glaucoma-challenge G1020: https://arxiv.org/abs/2006.09158
The datasets were modified to add fundus images cropped to the region of the optic disc and cup, and the corresponding masks (Images_Cropped and Masks_Cropped).
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (6 GB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arnavjain1/glaucoma-datasets
๐ Additional information:
==================================
Total files: 21,600
Views: 50,600
Downloads: 8,813
๐ RELATED NOTEBOOKS:
==================================
1. UNET Segmentation of OC/OD | Upvotes: 155
URL: https://www.kaggle.com/code/arnavjain1/unet-segmentation-of-oc-od
2. u_NET_GLAUCOMA DETECTION_FUDNDUS_PYTORCH | Upvotes: 41
URL: https://www.kaggle.com/code/chukwuebukaanulunko/u-net-glaucoma-detection-fudndus-pytorch
3. OC/OD segmentation using keras | Upvotes: 38
URL: https://www.kaggle.com/code/vuppalaadithyasairam/oc-od-segmentation-using-keras
4. Glaucoma OCT Scans (Origa) Augmented Dataset | Upvotes: 10
URL: https://www.kaggle.com/datasets/scipygaurav/glaucoma-oct-scans-origa-augmented-dataset
5. Glaucoma - OD/CD segmentation for YOLO. | Upvotes: 0
URL: https://www.kaggle.com/datasets/nhnthanhj/glaucoma-odcd-segmentation-for-yolo
==================================
โญ๏ธ By: https://t.me/datasets1
Forwarded from ENG. Hussein Sheikho
Thordata provides the perfect solution for all your data scraping needs!
Enjoy secure, uninterrupted scraping with our rotating and sticky IPs.
Perfect for avoiding blocks and handling high-volume requests.
Access over 195 countries with advanced targeting options to pinpoint your ideal IPs, whether by country, state, city, or ASN.
Get access to unlimited bandwidth and a 99.9% uptime guaranteeโideal for seamless, fast data collection.
Support for SOCKS5 and HTTP(S) protocols, ensuring compatibility with all your favorite scraping tools and services.
๐ง Start experiencing Thordataโs https://www.thordata.com/?ls=DhthVzyG&lk=Data power in your data science workflows today!
Whether itโs market research, machine learning, or competitive analysisโThordata is your trusted partner in efficient, scalable data scraping.
Please open Telegram to view this post
VIEW IN TELEGRAM
Thordata
Thordata - High-Quality Proxy Service for Web Data Scraping
Thordata's precision proxy solution was chosen to ensure seamless data collection. Enjoy the best prices and services tailored to your needs.
Kaggle Data Hub pinned ยซ๐ Searching for fast, reliable proxies for your data science and machine learning projects? Thordata provides the perfect solution for all your data scraping needs! ๐ https://www.thordata.com/?ls=DhthVzyG&lk=Data โจ Why Choose Thordata? โ
Rotating & Stickyโฆยป
Dataset Name: LFW - People (Face Recognition)
Basic Description: The Labeled Faces in the Wild face recognition dataset.
๐ FULL DATASET DESCRIPTION:
==================================
Welcome to Labeled Faces in the Wild, a database of face photographs designed for studying the problem of unconstrained face recognition. The data set contains more than 13,000 images of faces collected from the web. Each face has been labeled with the name of the person pictured. 1680 of the people pictured have two or more distinct photos in the data set. The only constraint on these faces is that they were detected by the Viola-Jones face detector.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (244 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/atulanandjha/lfwpeople
๐ Additional information:
==================================
File count not found
Views: 268,000
Downloads: 47,300
๐ RELATED NOTEBOOKS:
==================================
1. Face Detection with OpenCV | Upvotes: 1,975
URL: https://www.kaggle.com/code/serkanpeldek/face-detection-with-opencv
2. Face_Verification_and_Recognition | Upvotes: 578
URL: https://www.kaggle.com/code/diaaessam/face-verification-and-recognition
3. Face recognition - part 1 | Upvotes: 567
URL: https://www.kaggle.com/code/saidakbarp/face-recognition-part-1
4. Labelled Faces in the Wild with cropped faces | Upvotes: 4
URL: https://www.kaggle.com/datasets/jonathanloscalzo/lfw-cropped-faces
5. Labelled Faces in the Wild | Upvotes: 3
URL: https://www.kaggle.com/datasets/ashfaqsyed/labelled-faces-in-the-wild
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: The Labeled Faces in the Wild face recognition dataset.
๐ FULL DATASET DESCRIPTION:
==================================
Welcome to Labeled Faces in the Wild, a database of face photographs designed for studying the problem of unconstrained face recognition. The data set contains more than 13,000 images of faces collected from the web. Each face has been labeled with the name of the person pictured. 1680 of the people pictured have two or more distinct photos in the data set. The only constraint on these faces is that they were detected by the Viola-Jones face detector.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (244 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/atulanandjha/lfwpeople
๐ Additional information:
==================================
File count not found
Views: 268,000
Downloads: 47,300
๐ RELATED NOTEBOOKS:
==================================
1. Face Detection with OpenCV | Upvotes: 1,975
URL: https://www.kaggle.com/code/serkanpeldek/face-detection-with-opencv
2. Face_Verification_and_Recognition | Upvotes: 578
URL: https://www.kaggle.com/code/diaaessam/face-verification-and-recognition
3. Face recognition - part 1 | Upvotes: 567
URL: https://www.kaggle.com/code/saidakbarp/face-recognition-part-1
4. Labelled Faces in the Wild with cropped faces | Upvotes: 4
URL: https://www.kaggle.com/datasets/jonathanloscalzo/lfw-cropped-faces
5. Labelled Faces in the Wild | Upvotes: 3
URL: https://www.kaggle.com/datasets/ashfaqsyed/labelled-faces-in-the-wild
==================================
โญ๏ธ By: https://t.me/datasets1
โค2๐1
Dataset Name: Cards Image Dataset-Classification
Basic Description: 53 classes 7624 train, 265 test, 265 validation images 224 X 224 X 3 jpg format
๐ FULL DATASET DESCRIPTION:
==================================
This is a very high quality dataset of playing card images. All images are 224 X 224 X 3 in jpg format. All images in the dataset have been cropped so that only the image of a single card is present and the card occupies well over 50% of the pixels in the image. There are 7624 training images, 265 test images and 265 validation images. The train, test and validation directories are partitioned into 53 sub directories , one for each of the 53 types of cards. The dataset also includes a csv file which can be used to load the datasets.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (404 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/gpiosenka/cards-image-datasetclassification
๐ Additional information:
==================================
File count not found
Views: 93,600
Downloads: 45,800
๐ RELATED NOTEBOOKS:
==================================
1. Train Your first PyTorch Model [Card Classifier] | Upvotes: 3,086
URL: https://www.kaggle.com/code/robikscube/train-your-first-pytorch-model-card-classifier
2. Fruit Classification | Upvotes: 194
URL: https://www.kaggle.com/datasets/sshikamaru/fruit-recognition
3. Nike, Adidas and Converse Shoes Images | Upvotes: 105
URL: https://www.kaggle.com/datasets/die9origephit/nike-adidas-and-converse-imaged
4. EffecientNetB3-Cards-Classification-98.85 | Upvotes: 73
URL: https://www.kaggle.com/code/abdallahwagih/effecientnetb3-cards-classification-98-85
5. Card Classification | Pytorch | Upvotes: 55
URL: https://www.kaggle.com/code/youssefelbadry10/card-classification-pytorch
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: 53 classes 7624 train, 265 test, 265 validation images 224 X 224 X 3 jpg format
๐ FULL DATASET DESCRIPTION:
==================================
This is a very high quality dataset of playing card images. All images are 224 X 224 X 3 in jpg format. All images in the dataset have been cropped so that only the image of a single card is present and the card occupies well over 50% of the pixels in the image. There are 7624 training images, 265 test images and 265 validation images. The train, test and validation directories are partitioned into 53 sub directories , one for each of the 53 types of cards. The dataset also includes a csv file which can be used to load the datasets.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (404 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/gpiosenka/cards-image-datasetclassification
๐ Additional information:
==================================
File count not found
Views: 93,600
Downloads: 45,800
๐ RELATED NOTEBOOKS:
==================================
1. Train Your first PyTorch Model [Card Classifier] | Upvotes: 3,086
URL: https://www.kaggle.com/code/robikscube/train-your-first-pytorch-model-card-classifier
2. Fruit Classification | Upvotes: 194
URL: https://www.kaggle.com/datasets/sshikamaru/fruit-recognition
3. Nike, Adidas and Converse Shoes Images | Upvotes: 105
URL: https://www.kaggle.com/datasets/die9origephit/nike-adidas-and-converse-imaged
4. EffecientNetB3-Cards-Classification-98.85 | Upvotes: 73
URL: https://www.kaggle.com/code/abdallahwagih/effecientnetb3-cards-classification-98-85
5. Card Classification | Pytorch | Upvotes: 55
URL: https://www.kaggle.com/code/youssefelbadry10/card-classification-pytorch
==================================
โญ๏ธ By: https://t.me/datasets1
โค1
Dataset Name: Video Emotion Recognition Dataset - 1,000+ Video
Basic Description: Dataset contains videos of various facial and inner emotions from 1000+ people
๐ FULL DATASET DESCRIPTION:
==================================
Dataset comprises 1,000+ videos featuring 11 facial emotions and 15 inner emotions expressed by individuals from diverse backgrounds, including various races, genders, and ages. It is designed for emotion recognition research, focusing on emotion detection and emotion classification tasks.
By utilizing this dataset, researchers can explore advanced emotion analysis techniques and develop robust recognition models that can accurately identify and classify facial expressions and emotional categories. - Get the data
The dataset includes a wide range of emotional expressions, allowing for comprehensive studies in emotion predictions and expression recognition.
Variables in .csv files:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (6 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/unidpro/video-emotion-recognition-dataset
๐ Additional information:
==================================
File count not found
Views: 419
Downloads: 51
๐ RELATED NOTEBOOKS:
==================================
1. Speech Emotion Recognition Voice Dataset | Upvotes: 27
URL: https://www.kaggle.com/datasets/tapakah68/emotions-on-audio-dataset
2. Micro-expression Video Data | Upvotes: 3
URL: https://www.kaggle.com/datasets/nexdatafrank/micro-expression-video-data
3. Emotion Video Dataset (700 Speakers, Voice) | Upvotes: 1
URL: https://www.kaggle.com/datasets/maratdv/emotion-video-dataset-700-speakers-voice
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Dataset contains videos of various facial and inner emotions from 1000+ people
๐ FULL DATASET DESCRIPTION:
==================================
Dataset comprises 1,000+ videos featuring 11 facial emotions and 15 inner emotions expressed by individuals from diverse backgrounds, including various races, genders, and ages. It is designed for emotion recognition research, focusing on emotion detection and emotion classification tasks.
By utilizing this dataset, researchers can explore advanced emotion analysis techniques and develop robust recognition models that can accurately identify and classify facial expressions and emotional categories. - Get the data
The dataset includes a wide range of emotional expressions, allowing for comprehensive studies in emotion predictions and expression recognition.
Variables in .csv files:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (6 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/unidpro/video-emotion-recognition-dataset
๐ Additional information:
==================================
File count not found
Views: 419
Downloads: 51
๐ RELATED NOTEBOOKS:
==================================
1. Speech Emotion Recognition Voice Dataset | Upvotes: 27
URL: https://www.kaggle.com/datasets/tapakah68/emotions-on-audio-dataset
2. Micro-expression Video Data | Upvotes: 3
URL: https://www.kaggle.com/datasets/nexdatafrank/micro-expression-video-data
3. Emotion Video Dataset (700 Speakers, Voice) | Upvotes: 1
URL: https://www.kaggle.com/datasets/maratdv/emotion-video-dataset-700-speakers-voice
==================================
โญ๏ธ By: https://t.me/datasets1
โค1
Dataset Name: [Neur IPS 2020] Data Science for COVID-19 (DS4C)
Basic Description: [NeurIPS 2020] Data Science for COVID-19 (DS4C)
๐ FULL DATASET DESCRIPTION:
==================================
COVID-19 has infected more than 10,000 people in South Korea. KCDC (Korea Centers for Disease Control & Prevention) announces the information of COVID-19 quickly and transparently. We make a structured dataset based on the report materials of KCDC and local governments. Also, we analyze and visualize the data using various data mining or visualization techniques.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (7 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kimjihoo/coronavirusdataset
๐ Additional information:
==================================
File count not found
Views: 673,000
Downloads: 143,000
๐ RELATED NOTEBOOKS:
==================================
1. COVID19๐ฆ Explained through Visualizations | Upvotes: 685
URL: https://www.kaggle.com/code/anshuls235/covid19-explained-through-visualizations
2. Covid-19 Detection from Lung X-rays | Upvotes: 587
URL: https://www.kaggle.com/code/eswarchandt/covid-19-detection-from-lung-x-rays
3. Panorama do COVID-19 no Brasil | Upvotes: 388
URL: https://www.kaggle.com/code/elloaguedes/panorama-do-covid-19-no-brasil
4. COVID-19 Country Data | Upvotes: 32
URL: https://www.kaggle.com/datasets/bitsnpieces/covid19-country-data
5. COVID-19 in Korea dataset | Upvotes: 13
URL: https://www.kaggle.com/datasets/hongsean/covid19-in-korea-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: [NeurIPS 2020] Data Science for COVID-19 (DS4C)
๐ FULL DATASET DESCRIPTION:
==================================
COVID-19 has infected more than 10,000 people in South Korea. KCDC (Korea Centers for Disease Control & Prevention) announces the information of COVID-19 quickly and transparently. We make a structured dataset based on the report materials of KCDC and local governments. Also, we analyze and visualize the data using various data mining or visualization techniques.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (7 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kimjihoo/coronavirusdataset
๐ Additional information:
==================================
File count not found
Views: 673,000
Downloads: 143,000
๐ RELATED NOTEBOOKS:
==================================
1. COVID19๐ฆ Explained through Visualizations | Upvotes: 685
URL: https://www.kaggle.com/code/anshuls235/covid19-explained-through-visualizations
2. Covid-19 Detection from Lung X-rays | Upvotes: 587
URL: https://www.kaggle.com/code/eswarchandt/covid-19-detection-from-lung-x-rays
3. Panorama do COVID-19 no Brasil | Upvotes: 388
URL: https://www.kaggle.com/code/elloaguedes/panorama-do-covid-19-no-brasil
4. COVID-19 Country Data | Upvotes: 32
URL: https://www.kaggle.com/datasets/bitsnpieces/covid19-country-data
5. COVID-19 in Korea dataset | Upvotes: 13
URL: https://www.kaggle.com/datasets/hongsean/covid19-in-korea-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
โค1
Dataset Name: Fashion Product Images Dataset
Basic Description: 44k products with multiple category labels, descriptions and high-res images.
๐ FULL DATASET DESCRIPTION:
==================================
Thr growing e-commerce industry presents us with a large dataset waiting to be scraped and researched upon. In addition to professionally shot high resolution product images, we also have multiple label attributes describing the product which was manually entered while cataloging. To add to this, we also have descriptive text that comments on the product characteristics.
Each product is identified by an ID like 42431. You will find a map to all the products in styles.csv. From here, you can fetch the image for this product from images/42431.jpg and the complete metadata from styles/42431.json.
To get started easily, we also have exposed some of the key product categories and it's display name in styles.csv.
If this dataset is too large, you can start with a smaller (280MB) version here: https://www.kaggle.com/paramaggarwal/fashion-product-images-small
So what can you try building? Here are some suggestions:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (25 GB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/paramaggarwal/fashion-product-images-dataset
๐ Additional information:
==================================
Total files: 88,900
Views: 389,000
Downloads: 58,100
๐ RELATED NOTEBOOKS:
==================================
1. Building a Recommendation System Using CNN - v2 | Upvotes: 593
URL: https://www.kaggle.com/code/marlesson/building-a-recommendation-system-using-cnn-v2
2. Visually Similar Product Recommendation | Upvotes: 277
URL: https://www.kaggle.com/code/quadeer15sh/visually-similar-product-recommendation
3. Fashion Products Recommendation System | Upvotes: 259
URL: https://www.kaggle.com/code/basel99/fashion-products-recommendation-system
4. E-commerce Product Images | Upvotes: 83
URL: https://www.kaggle.com/datasets/vikashrajluhaniwal/fashion-images
5. numpy Weights for fashion product images dataset | Upvotes: 3
URL: https://www.kaggle.com/datasets/kalashj16/numpy-weights-for-fashion-product-images-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: 44k products with multiple category labels, descriptions and high-res images.
๐ FULL DATASET DESCRIPTION:
==================================
Thr growing e-commerce industry presents us with a large dataset waiting to be scraped and researched upon. In addition to professionally shot high resolution product images, we also have multiple label attributes describing the product which was manually entered while cataloging. To add to this, we also have descriptive text that comments on the product characteristics.
Each product is identified by an ID like 42431. You will find a map to all the products in styles.csv. From here, you can fetch the image for this product from images/42431.jpg and the complete metadata from styles/42431.json.
To get started easily, we also have exposed some of the key product categories and it's display name in styles.csv.
If this dataset is too large, you can start with a smaller (280MB) version here: https://www.kaggle.com/paramaggarwal/fashion-product-images-small
So what can you try building? Here are some suggestions:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (25 GB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/paramaggarwal/fashion-product-images-dataset
๐ Additional information:
==================================
Total files: 88,900
Views: 389,000
Downloads: 58,100
๐ RELATED NOTEBOOKS:
==================================
1. Building a Recommendation System Using CNN - v2 | Upvotes: 593
URL: https://www.kaggle.com/code/marlesson/building-a-recommendation-system-using-cnn-v2
2. Visually Similar Product Recommendation | Upvotes: 277
URL: https://www.kaggle.com/code/quadeer15sh/visually-similar-product-recommendation
3. Fashion Products Recommendation System | Upvotes: 259
URL: https://www.kaggle.com/code/basel99/fashion-products-recommendation-system
4. E-commerce Product Images | Upvotes: 83
URL: https://www.kaggle.com/datasets/vikashrajluhaniwal/fashion-images
5. numpy Weights for fashion product images dataset | Upvotes: 3
URL: https://www.kaggle.com/datasets/kalashj16/numpy-weights-for-fashion-product-images-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
โค1๐ฅ1
๐โ๏ธTODAY FREEโ๏ธ๐
Entry to our VIP channel is completely free today. Tomorrow it will cost $500! ๐ฅ
JOIN ๐
https://t.me/+Gc5luJUbfjRkMTk5
https://t.me/+Gc5luJUbfjRkMTk5
https://t.me/+Gc5luJUbfjRkMTk5
Entry to our VIP channel is completely free today. Tomorrow it will cost $500! ๐ฅ
JOIN ๐
https://t.me/+Gc5luJUbfjRkMTk5
https://t.me/+Gc5luJUbfjRkMTk5
https://t.me/+Gc5luJUbfjRkMTk5
Dataset Name: Real / Fake Job Posting Prediction
Basic Description: Dataset of real and fake job postings
๐ FULL DATASET DESCRIPTION:
==================================
This dataset contains 18K job descriptions out of which about 800 are fake. The data consists of both textual information and meta-information about the jobs. The dataset can be used to create classification models which can learn the job descriptions which are fraudulent.
The University of the Aegean | Laboratory of Information & Communication Systems Security http://emscad.samos.aegean.gr/
The dataset is very valuable as it can be used to answer the following questions:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (17 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/shivamb/real-or-fake-fake-jobposting-prediction
๐ Additional information:
==================================
File count not found
Views: 341,000
Downloads: 41,400
๐ RELATED NOTEBOOKS:
==================================
1. Text Classification using Keras/NB(97% Accuracy) | Upvotes: 292
URL: https://www.kaggle.com/code/madz2000/text-classification-using-keras-nb-97-accuracy
2. Fake Job Post Prediction: Countvec, GloVe, Bert | Upvotes: 193
URL: https://www.kaggle.com/code/vikassingh1996/fake-job-post-prediction-countvec-glove-bert
3. NLP(98%acc.) EDA with model using Spacy & Pipeline | Upvotes: 99
URL: https://www.kaggle.com/code/shivamburnwal/nlp-98-acc-eda-with-model-using-spacy-pipeline
4. Real OR Fake Jobs | Upvotes: 37
URL: https://www.kaggle.com/datasets/whenamancodes/real-or-fake-jobs
5. Job Postings Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/moyukhbiswas/job-postings-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Dataset of real and fake job postings
๐ FULL DATASET DESCRIPTION:
==================================
This dataset contains 18K job descriptions out of which about 800 are fake. The data consists of both textual information and meta-information about the jobs. The dataset can be used to create classification models which can learn the job descriptions which are fraudulent.
The University of the Aegean | Laboratory of Information & Communication Systems Security http://emscad.samos.aegean.gr/
The dataset is very valuable as it can be used to answer the following questions:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (17 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/shivamb/real-or-fake-fake-jobposting-prediction
๐ Additional information:
==================================
File count not found
Views: 341,000
Downloads: 41,400
๐ RELATED NOTEBOOKS:
==================================
1. Text Classification using Keras/NB(97% Accuracy) | Upvotes: 292
URL: https://www.kaggle.com/code/madz2000/text-classification-using-keras-nb-97-accuracy
2. Fake Job Post Prediction: Countvec, GloVe, Bert | Upvotes: 193
URL: https://www.kaggle.com/code/vikassingh1996/fake-job-post-prediction-countvec-glove-bert
3. NLP(98%acc.) EDA with model using Spacy & Pipeline | Upvotes: 99
URL: https://www.kaggle.com/code/shivamburnwal/nlp-98-acc-eda-with-model-using-spacy-pipeline
4. Real OR Fake Jobs | Upvotes: 37
URL: https://www.kaggle.com/datasets/whenamancodes/real-or-fake-jobs
5. Job Postings Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/moyukhbiswas/job-postings-dataset
==================================
โญ๏ธ By: https://t.me/datasets1
โค3
Dataset Name: Chest CT Segmentation
Basic Description: Chest CT scans together with segmentation masks for lung, heart, and trachea.
๐ FULL DATASET DESCRIPTION:
==================================
This dataset was be modified from Lung segmentation dataset by Kรณnya et al., 2020 , https://www.kaggle.com/sandorkonya/ct-lung-heart-trachea-segmentation
The original nrrd files were re-saved in single tensor format with masks corresponding to labels: (lungs, heart, trachea) as numpy arrays using pickle.
Each tensor has the following shape: number of slices, width, height, number of classes, where the width and height number of slices are individual parameters of each tensor id, and number of classes = 3.
In addition, the data was re-saved as RGB images, where each image corresponds to one ID slice, and their mask-images have channels corresponding to three classes: (lung, heart, trachea).
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (2 GB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/polomarco/chest-ct-segmentation
๐ Additional information:
==================================
Total files: 33,700
Views: 35,000
Downloads: 5,442
๐ RELATED NOTEBOOKS:
==================================
1. Chest CT Segmentation | Lung & Heart & Trachea. | Upvotes: 152
URL: https://www.kaggle.com/code/polomarco/chest-ct-segmentation-lung-heart-trachea
2. Chest CT Segmentation Unet | Upvotes: 61
URL: https://www.kaggle.com/code/hossamfakher/chest-ct-segmentation-unet
3. UNET Lung Segmentation Weights for Chest X Rays | Upvotes: 48
URL: https://www.kaggle.com/datasets/farhanhaikhan/unet-lung-segmentation-weights-for-chest-x-rays
4. Aorta Grad | Upvotes: 16
URL: https://www.kaggle.com/code/yousefaborizk/aorta-grad
5. Labeled Internal Features for Chest Xrays | Upvotes: 3
URL: https://www.kaggle.com/datasets/jasoncastiglione/labeled-internal-features-for-chest-xrays
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Chest CT scans together with segmentation masks for lung, heart, and trachea.
๐ FULL DATASET DESCRIPTION:
==================================
This dataset was be modified from Lung segmentation dataset by Kรณnya et al., 2020 , https://www.kaggle.com/sandorkonya/ct-lung-heart-trachea-segmentation
The original nrrd files were re-saved in single tensor format with masks corresponding to labels: (lungs, heart, trachea) as numpy arrays using pickle.
Each tensor has the following shape: number of slices, width, height, number of classes, where the width and height number of slices are individual parameters of each tensor id, and number of classes = 3.
In addition, the data was re-saved as RGB images, where each image corresponds to one ID slice, and their mask-images have channels corresponding to three classes: (lung, heart, trachea).
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (2 GB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/polomarco/chest-ct-segmentation
๐ Additional information:
==================================
Total files: 33,700
Views: 35,000
Downloads: 5,442
๐ RELATED NOTEBOOKS:
==================================
1. Chest CT Segmentation | Lung & Heart & Trachea. | Upvotes: 152
URL: https://www.kaggle.com/code/polomarco/chest-ct-segmentation-lung-heart-trachea
2. Chest CT Segmentation Unet | Upvotes: 61
URL: https://www.kaggle.com/code/hossamfakher/chest-ct-segmentation-unet
3. UNET Lung Segmentation Weights for Chest X Rays | Upvotes: 48
URL: https://www.kaggle.com/datasets/farhanhaikhan/unet-lung-segmentation-weights-for-chest-x-rays
4. Aorta Grad | Upvotes: 16
URL: https://www.kaggle.com/code/yousefaborizk/aorta-grad
5. Labeled Internal Features for Chest Xrays | Upvotes: 3
URL: https://www.kaggle.com/datasets/jasoncastiglione/labeled-internal-features-for-chest-xrays
==================================
โญ๏ธ By: https://t.me/datasets1
โค2
Dataset Name: Fit Bit Fitness Tracker Data
Basic Description: FitBit Fitness Tracker Data
๐ FULL DATASET DESCRIPTION:
==================================
This dataset generated by respondents to a distributed survey via Amazon Mechanical Turk between 03.12.2016-05.12.2016. Thirty eligible Fitbit users consented to the submission of personal tracker data, including minute-level output for physical activity, heart rate, and sleep monitoring. Individual reports can be parsed by export session ID (column A) or timestamp (column B). Variation between output represents use of different types of Fitbit trackers and individual tracking behaviors / preferences.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (45 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arashnic/fitbit
๐ Additional information:
==================================
File count not found
Views: 892,000
Downloads: 180,000
๐ RELATED NOTEBOOKS:
==================================
1. Bellabeat Case Study with R | Upvotes: 1,582
URL: https://www.kaggle.com/code/chebotinaa/bellabeat-case-study-with-r
2. Capstone - Case Study Bellabeat | Upvotes: 1,404
URL: https://www.kaggle.com/code/macarenalacasa/capstone-case-study-bellabeat
3. Bellabeat - Case study | Upvotes: 645
URL: https://www.kaggle.com/code/julenaranguren/bellabeat-case-study
4. Fitabase Combined Data 3/12/16-5/12/16 | Upvotes: 17
URL: https://www.kaggle.com/datasets/michaelpetersen2022/fitabase-combined-data-31216-51216
5. Bellabeat Casestudy | Upvotes: 1
URL: https://www.kaggle.com/datasets/yosefsisay/bellabeat-casestudy
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: FitBit Fitness Tracker Data
๐ FULL DATASET DESCRIPTION:
==================================
This dataset generated by respondents to a distributed survey via Amazon Mechanical Turk between 03.12.2016-05.12.2016. Thirty eligible Fitbit users consented to the submission of personal tracker data, including minute-level output for physical activity, heart rate, and sleep monitoring. Individual reports can be parsed by export session ID (column A) or timestamp (column B). Variation between output represents use of different types of Fitbit trackers and individual tracking behaviors / preferences.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (45 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arashnic/fitbit
๐ Additional information:
==================================
File count not found
Views: 892,000
Downloads: 180,000
๐ RELATED NOTEBOOKS:
==================================
1. Bellabeat Case Study with R | Upvotes: 1,582
URL: https://www.kaggle.com/code/chebotinaa/bellabeat-case-study-with-r
2. Capstone - Case Study Bellabeat | Upvotes: 1,404
URL: https://www.kaggle.com/code/macarenalacasa/capstone-case-study-bellabeat
3. Bellabeat - Case study | Upvotes: 645
URL: https://www.kaggle.com/code/julenaranguren/bellabeat-case-study
4. Fitabase Combined Data 3/12/16-5/12/16 | Upvotes: 17
URL: https://www.kaggle.com/datasets/michaelpetersen2022/fitabase-combined-data-31216-51216
5. Bellabeat Casestudy | Upvotes: 1
URL: https://www.kaggle.com/datasets/yosefsisay/bellabeat-casestudy
==================================
โญ๏ธ By: https://t.me/datasets1
โค4
Forwarded from Machine Learning with Python
This channels is for Programmers, Coders, Software Engineers.
0๏ธโฃ Python
1๏ธโฃ Data Science
2๏ธโฃ Machine Learning
3๏ธโฃ Data Visualization
4๏ธโฃ Artificial Intelligence
5๏ธโฃ Data Analysis
6๏ธโฃ Statistics
7๏ธโฃ Deep Learning
8๏ธโฃ programming Languages
โ
https://t.me/addlist/8_rRW2scgfRhOTc0
โ
https://t.me/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
Dataset Name: Indicators of Heart Disease (2022 UPDATE)
Basic Description: 2022 annual CDC survey data of 400k+ adults related to their health status
๐ FULL DATASET DESCRIPTION:
==================================
According to the CDC, heart disease is a leading cause of death for people of most races in the U.S. (African Americans, American Indians and Alaska Natives, and whites). About half of all Americans (47%) have at least 1 of 3 major risk factors for heart disease: high blood pressure, high cholesterol, and smoking. Other key indicators include diabetes status, obesity (high BMI), not getting enough physical activity, or drinking too much alcohol. Identifying and preventing the factors that have the greatest impact on heart disease is very important in healthcare. In turn, developments in computing allow the application of machine learning methods to detect "patterns" in the data that can predict a patient's condition.
The dataset originally comes from the CDC and is a major part of the Behavioral Risk Factor Surveillance System (BRFSS), which conducts annual telephone surveys to collect data on the health status of U.S. residents. As described by the CDC: "Established in 1984 with 15 states, BRFSS now collects data in all 50 states, the District of Columbia, and three U.S. territories. BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world. The most recent dataset includes data from 2023. In this dataset, I noticed many factors (questions) that directly or indirectly influence heart disease, so I decided to select the most relevant variables from it. I also decided to share with you two versions of the most recent dataset: with NaNs and without it.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (22 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kamilpytlak/personal-key-indicators-of-heart-disease
๐ Additional information:
==================================
File count not found
Views: 503,000
Downloads: 90,500
๐ RELATED NOTEBOOKS:
==================================
1. Heart Disease. Exploratory data analysis. | Upvotes: 981
URL: https://www.kaggle.com/code/georgyzubkov/heart-disease-exploratory-data-analysis
2. Diabetes Health Indicators Dataset | Upvotes: 771
URL: https://www.kaggle.com/datasets/alexteboul/diabetes-health-indicators-dataset
3. Heart Disease Prediction | Upvotes: 736
URL: https://www.kaggle.com/code/andls555/heart-disease-prediction
4. Advance Data Preprocessing | Upvotes: 711
URL: https://www.kaggle.com/code/nkitgupta/advance-data-preprocessing
5. Coronary Heart Disease Prediction in Ten Years | Upvotes: 2
URL: https://www.kaggle.com/datasets/palakdoshijain/coronary-heart-disease-prediction-in-ten-years
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: 2022 annual CDC survey data of 400k+ adults related to their health status
๐ FULL DATASET DESCRIPTION:
==================================
According to the CDC, heart disease is a leading cause of death for people of most races in the U.S. (African Americans, American Indians and Alaska Natives, and whites). About half of all Americans (47%) have at least 1 of 3 major risk factors for heart disease: high blood pressure, high cholesterol, and smoking. Other key indicators include diabetes status, obesity (high BMI), not getting enough physical activity, or drinking too much alcohol. Identifying and preventing the factors that have the greatest impact on heart disease is very important in healthcare. In turn, developments in computing allow the application of machine learning methods to detect "patterns" in the data that can predict a patient's condition.
The dataset originally comes from the CDC and is a major part of the Behavioral Risk Factor Surveillance System (BRFSS), which conducts annual telephone surveys to collect data on the health status of U.S. residents. As described by the CDC: "Established in 1984 with 15 states, BRFSS now collects data in all 50 states, the District of Columbia, and three U.S. territories. BRFSS completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world. The most recent dataset includes data from 2023. In this dataset, I noticed many factors (questions) that directly or indirectly influence heart disease, so I decided to select the most relevant variables from it. I also decided to share with you two versions of the most recent dataset: with NaNs and without it.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (22 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kamilpytlak/personal-key-indicators-of-heart-disease
๐ Additional information:
==================================
File count not found
Views: 503,000
Downloads: 90,500
๐ RELATED NOTEBOOKS:
==================================
1. Heart Disease. Exploratory data analysis. | Upvotes: 981
URL: https://www.kaggle.com/code/georgyzubkov/heart-disease-exploratory-data-analysis
2. Diabetes Health Indicators Dataset | Upvotes: 771
URL: https://www.kaggle.com/datasets/alexteboul/diabetes-health-indicators-dataset
3. Heart Disease Prediction | Upvotes: 736
URL: https://www.kaggle.com/code/andls555/heart-disease-prediction
4. Advance Data Preprocessing | Upvotes: 711
URL: https://www.kaggle.com/code/nkitgupta/advance-data-preprocessing
5. Coronary Heart Disease Prediction in Ten Years | Upvotes: 2
URL: https://www.kaggle.com/datasets/palakdoshijain/coronary-heart-disease-prediction-in-ten-years
==================================
โญ๏ธ By: https://t.me/datasets1
โค1
Forwarded from Machine Learning
๐ฅ Trending Repository: awesome-public-datasets
๐ Description: A topic-centric list of HQ open datasets.
๐ Repository URL: https://github.com/awesomedata/awesome-public-datasets
๐ Website: https://awesomedataworld.slack.com
๐ Readme: https://github.com/awesomedata/awesome-public-datasets#readme
๐ Statistics:
๐ Stars: 64.6K stars
๐ Watchers: 2.3k
๐ด Forks: 10.3K forks
๐ป Programming Languages: Not available
๐ท๏ธ Related Topics:
==================================
๐ง By: https://t.me/DataScienceM
๐ Description: A topic-centric list of HQ open datasets.
๐ Repository URL: https://github.com/awesomedata/awesome-public-datasets
๐ Website: https://awesomedataworld.slack.com
๐ Readme: https://github.com/awesomedata/awesome-public-datasets#readme
๐ Statistics:
๐ Stars: 64.6K stars
๐ Watchers: 2.3k
๐ด Forks: 10.3K forks
๐ป Programming Languages: Not available
๐ท๏ธ Related Topics:
#opendata #datasets #aaron_swartz #awesome_public_datasets
==================================
๐ง By: https://t.me/DataScienceM
โค1
Dataset Name: Energy consumption of the Netherlands
Basic Description: Electricity and Gas consumed in the Netherlands every year
๐ FULL DATASET DESCRIPTION:
==================================
The energy network of the Netherlands is managed by a few companies. Every year, these companies release on their websites a table with the energy consumption of the areas under their administration. The companies are
The data are anonymized by aggregating the Zipcodes so that every entry describes at least 10 connections.
This market is not competitive, meaning that the zones are assigned. This means that every year they roughly provide energy to the same zipcodes. Small changes can happen from year to year either for a change of management or for a different aggregation of zipcodes.
Every file contains information about groups of zipcodes managed by one of the three companies for a specific year.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (146 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/lucabasa/dutch-energy
๐ Additional information:
==================================
File count not found
Views: 139,000
Downloads: 111,000
๐ RELATED NOTEBOOKS:
==================================
1. ใฝ๏ธ|3๏ธโฃWays to Deal with Time Series Forecasting | Upvotes: 296
URL: https://www.kaggle.com/code/mfaaris/3-ways-to-deal-with-time-series-forecasting
2. Dutch electricity: EDA, FS, clustering, maps | Upvotes: 199
URL: https://www.kaggle.com/code/bberghuis/dutch-electricity-eda-fs-clustering-maps
3. โก๏ธEnergy EDA๐, Segmentation and Prediction๐งฎ | Upvotes: 86
URL: https://www.kaggle.com/code/ihsncnkz/energy-eda-segmentation-and-prediction
4. Western Europe Power Consumption | Upvotes: 34
URL: https://www.kaggle.com/datasets/francoisraucent/western-europe-power-consumption
5. Monthly Electricity Production in GWh [2010-2022] | Upvotes: 26
URL: https://www.kaggle.com/datasets/ccanb23/iea-monthly-electricity-statistics
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Electricity and Gas consumed in the Netherlands every year
๐ FULL DATASET DESCRIPTION:
==================================
The energy network of the Netherlands is managed by a few companies. Every year, these companies release on their websites a table with the energy consumption of the areas under their administration. The companies are
The data are anonymized by aggregating the Zipcodes so that every entry describes at least 10 connections.
This market is not competitive, meaning that the zones are assigned. This means that every year they roughly provide energy to the same zipcodes. Small changes can happen from year to year either for a change of management or for a different aggregation of zipcodes.
Every file contains information about groups of zipcodes managed by one of the three companies for a specific year.
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (146 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/lucabasa/dutch-energy
๐ Additional information:
==================================
File count not found
Views: 139,000
Downloads: 111,000
๐ RELATED NOTEBOOKS:
==================================
1. ใฝ๏ธ|3๏ธโฃWays to Deal with Time Series Forecasting | Upvotes: 296
URL: https://www.kaggle.com/code/mfaaris/3-ways-to-deal-with-time-series-forecasting
2. Dutch electricity: EDA, FS, clustering, maps | Upvotes: 199
URL: https://www.kaggle.com/code/bberghuis/dutch-electricity-eda-fs-clustering-maps
3. โก๏ธEnergy EDA๐, Segmentation and Prediction๐งฎ | Upvotes: 86
URL: https://www.kaggle.com/code/ihsncnkz/energy-eda-segmentation-and-prediction
4. Western Europe Power Consumption | Upvotes: 34
URL: https://www.kaggle.com/datasets/francoisraucent/western-europe-power-consumption
5. Monthly Electricity Production in GWh [2010-2022] | Upvotes: 26
URL: https://www.kaggle.com/datasets/ccanb23/iea-monthly-electricity-statistics
==================================
โญ๏ธ By: https://t.me/datasets1
โค6
Dataset Name: Tuberculosis (TB) Chest X-ray Database
Basic Description: The largest TB Chest X-ray Database
๐ FULL DATASET DESCRIPTION:
==================================
Tuberculosis (TB) Chest X-ray Database A team of researchers from Qatar University, Doha, Qatar, and the University of Dhaka, Bangladesh along with their collaborators from Malaysia in collaboration with medical doctors from Hamad Medical Corporation and Bangladesh have created a database of chest X-ray images for Tuberculosis (TB) positive cases along with Normal images. In our current release, there are 700 TB images publicly accessible and 2800 TB images can be downloaded from NIAID TB portal[3] by signing an agreement, and 3500 normal images.
Note: -The research team managed to classify TB and Normal Chest X-ray images with an accuracy of 98.3%. This scholarly work is published in IEEE Access. Please make sure you give credit to us while using the dataset, code, and trained models.
Credit should go to the following: Tawsifur Rahman, Amith Khandakar, Muhammad A. Kadir, Khandaker R. Islam, Khandaker F. Islam, Zaid B. Mahbub, Mohamed Arselene Ayari, Muhammad E. H. Chowdhury. (2020) "Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization". IEEE Access, Vol. 8, pp 191586 - 191601. DOI. 10.1109/ACCESS.2020.3031384. Paper Link
To view images please check image folders and references of each image are provided in the metadata.csv.
Research Team members and their affiliation Muhammad E. H. Chowdhury, PhD (mchowdhury@qu.edu.qa) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Tawsifur Rahman (tawsifurrahman.1426@gmail.com) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Amith Khandakar (amitk@qu.edu.qa) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Rashid Mazhar, MD Thoracic Surgery, Hamad General Hospital, Doha-3050, Qatar Muhammad Abdul Kadir, PhD Department of Biomedical Physics & Technology, University of Dhaka, Dhaka-1000, Bangladesh Zaid Bin Mahbub, PhD Department of Mathematics and Physics, North South University, Dhaka-1229, Bangladesh Khandakar R. Islam, MD Department of Orthodontics, Bangabandhu Sheikh Mujib Medical University, Dhaka-1000, Bangladesh
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (696 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/tawsifurrahman/tuberculosis-tb-chest-xray-dataset
๐ Additional information:
==================================
File count not found
Views: 171,000
Downloads: 29,200
๐ RELATED NOTEBOOKS:
==================================
1. Tuberculosis_classification_DenseNet121_GradCAM | Upvotes: 219
URL: https://www.kaggle.com/code/sanphats/tuberculosis-classification-densenet121-gradcam
2. Pneumonia Detections Using Deep Learning | Upvotes: 214
URL: https://www.kaggle.com/code/chanchal24/pneumonia-detections-using-deep-learning
3. tuberculosis_0.99 accuracy | Upvotes: 168
URL: https://www.kaggle.com/code/akshayr009/tuberculosis-0-99-accuracy
4. DA and DB - TB Chest X-ray Datasets | Upvotes: 4
URL: https://www.kaggle.com/datasets/vbookshelf/da-and-db-tb-chest-x-ray-datasets
5. Dataset (Covid-Bacterial-Viral-Normal-Emphysema) | Upvotes: 1
URL: https://www.kaggle.com/datasets/minhnhat232/dataset-covid-bacterial-viral-normal-emphysema
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: The largest TB Chest X-ray Database
๐ FULL DATASET DESCRIPTION:
==================================
Tuberculosis (TB) Chest X-ray Database A team of researchers from Qatar University, Doha, Qatar, and the University of Dhaka, Bangladesh along with their collaborators from Malaysia in collaboration with medical doctors from Hamad Medical Corporation and Bangladesh have created a database of chest X-ray images for Tuberculosis (TB) positive cases along with Normal images. In our current release, there are 700 TB images publicly accessible and 2800 TB images can be downloaded from NIAID TB portal[3] by signing an agreement, and 3500 normal images.
Note: -The research team managed to classify TB and Normal Chest X-ray images with an accuracy of 98.3%. This scholarly work is published in IEEE Access. Please make sure you give credit to us while using the dataset, code, and trained models.
Credit should go to the following: Tawsifur Rahman, Amith Khandakar, Muhammad A. Kadir, Khandaker R. Islam, Khandaker F. Islam, Zaid B. Mahbub, Mohamed Arselene Ayari, Muhammad E. H. Chowdhury. (2020) "Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization". IEEE Access, Vol. 8, pp 191586 - 191601. DOI. 10.1109/ACCESS.2020.3031384. Paper Link
To view images please check image folders and references of each image are provided in the metadata.csv.
Research Team members and their affiliation Muhammad E. H. Chowdhury, PhD (mchowdhury@qu.edu.qa) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Tawsifur Rahman (tawsifurrahman.1426@gmail.com) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Amith Khandakar (amitk@qu.edu.qa) Department of Electrical Engineering, Qatar University, Doha-2713, Qatar Rashid Mazhar, MD Thoracic Surgery, Hamad General Hospital, Doha-3050, Qatar Muhammad Abdul Kadir, PhD Department of Biomedical Physics & Technology, University of Dhaka, Dhaka-1000, Bangladesh Zaid Bin Mahbub, PhD Department of Mathematics and Physics, North South University, Dhaka-1229, Bangladesh Khandakar R. Islam, MD Department of Orthodontics, Bangabandhu Sheikh Mujib Medical University, Dhaka-1000, Bangladesh
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (696 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/tawsifurrahman/tuberculosis-tb-chest-xray-dataset
๐ Additional information:
==================================
File count not found
Views: 171,000
Downloads: 29,200
๐ RELATED NOTEBOOKS:
==================================
1. Tuberculosis_classification_DenseNet121_GradCAM | Upvotes: 219
URL: https://www.kaggle.com/code/sanphats/tuberculosis-classification-densenet121-gradcam
2. Pneumonia Detections Using Deep Learning | Upvotes: 214
URL: https://www.kaggle.com/code/chanchal24/pneumonia-detections-using-deep-learning
3. tuberculosis_0.99 accuracy | Upvotes: 168
URL: https://www.kaggle.com/code/akshayr009/tuberculosis-0-99-accuracy
4. DA and DB - TB Chest X-ray Datasets | Upvotes: 4
URL: https://www.kaggle.com/datasets/vbookshelf/da-and-db-tb-chest-x-ray-datasets
5. Dataset (Covid-Bacterial-Viral-Normal-Emphysema) | Upvotes: 1
URL: https://www.kaggle.com/datasets/minhnhat232/dataset-covid-bacterial-viral-normal-emphysema
==================================
โญ๏ธ By: https://t.me/datasets1
โค5
Dataset Name: NBA Database
Basic Description: Daily Updated SQLite Database โ 64,000+ Games, 4800+ Players, and 30 Teams ๐
๐ FULL DATASET DESCRIPTION:
==================================
This dataset is updated daily and includes:
โฎ View the and โฎ Sponsor project:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (731 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/wyattowalsh/basketball
๐ Additional information:
==================================
File count not found
Views: 353,000
Downloads: 46,700
๐ RELATED NOTEBOOKS:
==================================
1. Historic NBA Drafting, Game, and Player Analysis | Upvotes: 263
URL: https://www.kaggle.com/code/agilesifaka/historic-nba-drafting-game-and-player-analysis
2. NBA Stats (1947-present) | Upvotes: 195
URL: https://www.kaggle.com/datasets/sumitrodatta/nba-aba-baa-stats
3. Database Updater (Daily) | Upvotes: 58
URL: https://www.kaggle.com/code/wyattowalsh/database-updater-daily
4. Using SQL | Upvotes: 58
URL: https://www.kaggle.com/code/wyattowalsh/using-sql
5. NBA Games Box Score Since 1949 | Upvotes: 11
URL: https://www.kaggle.com/datasets/rafaelgreca/nba-games-box-score-since-1949
==================================
โญ๏ธ By: https://t.me/datasets1
Basic Description: Daily Updated SQLite Database โ 64,000+ Games, 4800+ Players, and 30 Teams ๐
๐ FULL DATASET DESCRIPTION:
==================================
This dataset is updated daily and includes:
โฎ View the and โฎ Sponsor project:
๐ฅ DATASET DOWNLOAD INFORMATION
==================================
๐ด Dataset Size: Download dataset as zip (731 MB)
๐ฐ Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/wyattowalsh/basketball
๐ Additional information:
==================================
File count not found
Views: 353,000
Downloads: 46,700
๐ RELATED NOTEBOOKS:
==================================
1. Historic NBA Drafting, Game, and Player Analysis | Upvotes: 263
URL: https://www.kaggle.com/code/agilesifaka/historic-nba-drafting-game-and-player-analysis
2. NBA Stats (1947-present) | Upvotes: 195
URL: https://www.kaggle.com/datasets/sumitrodatta/nba-aba-baa-stats
3. Database Updater (Daily) | Upvotes: 58
URL: https://www.kaggle.com/code/wyattowalsh/database-updater-daily
4. Using SQL | Upvotes: 58
URL: https://www.kaggle.com/code/wyattowalsh/using-sql
5. NBA Games Box Score Since 1949 | Upvotes: 11
URL: https://www.kaggle.com/datasets/rafaelgreca/nba-games-box-score-since-1949
==================================
โญ๏ธ By: https://t.me/datasets1
โค8