Kaggle Data Hub
29.2K subscribers
947 photos
15 videos
309 files
1.21K links
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Dataset Name: A Large Scale Fish Dataset
Basic Description: A Large-Scale Dataset for Fish Segmentation and Classification

πŸ“– FULL DATASET DESCRIPTION:
==================================
A Large-Scale Dataset for Segmentation and Classification
Authors: O. Ulucan, D. Karakaya, M. Turkan Department of Electrical and Electronics Engineering, Izmir University of Economics, Izmir, Turkey Corresponding author: M. Turkan Contact Information: mehmet.turkan@ieu.edu.tr
General Introduction
This dataset contains 9 different seafood types collected from a supermarket in Izmir, Turkey for a university-industry collaboration project at Izmir University of Economics, and this work was published in ASYU 2020. The dataset includes gilt head bream, red sea bream, sea bass, red mullet, horse mackerel, black sea sprat, striped red mullet, trout, shrimp image samples.
If you use this dataset in your work, please consider to cite:
@inproceedings{ulucan2020large, title={A Large-Scale Dataset for Fish Segmentation and Classification}, author={Ulucan, Oguzhan and Karakaya, Diclehan and Turkan, Mehmet}, booktitle={2020 Innovations in Intelligent Systems and Applications Conference (ASYU)}, pages={1--5}, year={2020}, organization={IEEE} }

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (3 GB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/crowww/a-large-scale-fish-dataset

πŸ“Š Additional information:
==================================
Total files: 18,400
Views: 290,000
Downloads: 30,800

πŸ“š RELATED NOTEBOOKS:
==================================
1. Fish classifier & Grad-CAM viz (acc. 99,89%)🐟 | Upvotes: 397
URL: https://www.kaggle.com/code/databeru/fish-classifier-grad-cam-viz-acc-99-89

2. Fish Analysis 🐠🐠🐑🐑 ♓️ ♓️ | Upvotes: 308
URL: https://www.kaggle.com/code/fahadmehfoooz/fish-analysis

3. 🐟 Fish Image Species Classification | Upvotes: 215
URL: https://www.kaggle.com/code/gcdatkin/fish-image-species-classification

4. Fish Dataset | Upvotes: 52
URL: https://www.kaggle.com/datasets/markdaniellampa/fish-dataset

5. Tilapia Fresh and Non Fresh Image Dataset | Upvotes: 6
URL: https://www.kaggle.com/datasets/haripriyasanga/tilapia-fish-fresh-and-non-fresh-species

==================================
⭐️ By: https://t.me/datasets1
Dataset Name: CT KIDNEY DATASET: Normal-Cyst-Tumor and Stone
Basic Description: Dataset to detect auto Kidney Disease Analysis

πŸ“– FULL DATASET DESCRIPTION:
==================================
CT KIDNEY DATASET: Normal-Cyst-Tumor and Stone
The dataset was collected from PACS (Picture archiving and communication system) from different hospitals in Dhaka, Bangladesh where patients were already diagnosed with having a kidney tumor, cyst, normal or stone findings. Both the Coronal and Axial cuts were selected from both contrast and non-contrast studies with protocol for the whole abdomen and urogram. The Dicom study was then carefully selected, one diagnosis at a time, and from those we created a batch of Dicom images of the region of interest for each radiological finding. Following that, we excluded each patient's information and meta data from the Dicom images and converted the Dicom images to a lossless jpg image format. After the conversion, each image finding was again verified by a radiologist and a medical technologist to reconfirm the correctness of the data.
Our created dataset contains 12,446 unique data within it in which the cyst contains 3,709, normal 5,077, stone 1,377, and tumor 2,283
Kindly Cite if you are finding this helpful-
Islam MN, Hasan M, Hossain M, Alam M, Rabiul G, Uddin MZ, Soylu A. Vision transformer and explainable transfer learning models for auto detection of kidney cyst, stone and tumor from CT-radiography. Scientific Reports. 2022 Jul 6;12(1):1-4.
Thanks to Mehedi Hasan, Medical Technologist, who assisted to gather all the data from different hospitals.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (2 GB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/nazmul0087/ct-kidney-dataset-normal-cyst-tumor-and-stone

πŸ“Š Additional information:
==================================
Total files: 12,400
Views: 114,000
Downloads: 24,500

πŸ“š RELATED NOTEBOOKS:
==================================
1. KIDNEY-diseases 0.999 accuracy | Upvotes: 132
URL: https://www.kaggle.com/code/akshayr009/kidney-diseases-0-999-accuracy

2. KidneyVision | Upvotes: 111
URL: https://www.kaggle.com/code/atifaliak/kidneyvision

3. Kidney Disease Classifier With 99% (CNN) | Upvotes: 109
URL: https://www.kaggle.com/code/ahmedbadr22/kidney-disease-classifier-with-99-cnn

4. Kidney Stone Images with Bounding Box Annotations | Upvotes: 69
URL: https://www.kaggle.com/datasets/safurahajiheidari/kidney-stone-images

5. Kidney Stone | Classification and Object Detection | Upvotes: 26
URL: https://www.kaggle.com/datasets/imtkaggleteam/kidney-stone-classification-and-object-detection

==================================
⭐️ By: https://t.me/datasets1
❀2πŸ”₯2
Dataset Name: COVID19 Tweets
Basic Description: Tweets with the hashtag #covid19

πŸ“– FULL DATASET DESCRIPTION:
==================================
These tweets are collected using Twitter API and a Python script. A query for this high-frequency hashtag (#covid19) is run on a daily basis for a certain time period, to collect a larger number of tweets samples.
The collection script can be found here: https://github.com/gabrielpreda/covid-19-tweets
The tweets have #covid19 hashtag. Collection started on 25/7/2020, with an initial 17k batch and will continue on a daily basis.
You can use this data to dive into the subjects that use this hashtag, look to the geographical distribution, evaluate sentiments, looks to trends.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (29 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/gpreda/covid19-tweets

πŸ“Š Additional information:
==================================
File count not found
Views: 199,000
Downloads: 25,400

πŸ“š RELATED NOTEBOOKS:
==================================
1. 🦠COVID-19: Sentiment Analysis & Social Networks | Upvotes: 546
URL: https://www.kaggle.com/code/andradaolteanu/covid-19-sentiment-analysis-social-networks

2. Text-Representations | Upvotes: 424
URL: https://www.kaggle.com/code/nkitgupta/text-representations

3. Covid 19 tweet sentiment analysis | Upvotes: 246
URL: https://www.kaggle.com/code/alankritamishra/covid-19-tweet-sentiment-analysis

4. Black Friday Tweets | Upvotes: 18
URL: https://www.kaggle.com/datasets/mathurinache/black-friday-tweets

5. COVID-19 Tweets (Second Wave) | Upvotes: 9
URL: https://www.kaggle.com/datasets/himanshutripathi/covid19-tweets-second-wave

==================================
⭐️ By: https://t.me/datasets1
Dataset Name: Underwater Object Detection Dataset
Basic Description: Yolov5 PyTorch format underwater life dataset for object detection

πŸ“– FULL DATASET DESCRIPTION:
==================================
The dataset contains 7 classes of underwater creatures with provided bboxes locations for every animal. The dataset is already split into the train, validation, and test sets.
It includes 638 images.
The following pre-processing was applied to each image:

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (70 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/slavkoprytula/aquarium-data-cots

πŸ“Š Additional information:
==================================
File count not found
Views: 34,600
Downloads: 7,348

πŸ“š RELATED NOTEBOOKS:
==================================
1. Underwater_Object_Detection_with_YOLO_v8 | Upvotes: 103
URL: https://www.kaggle.com/code/quydau/underwater-object-detection-with-yolo-v8

2. Underwater Object Detection | Upvotes: 99
URL: https://www.kaggle.com/code/ahmedabdelkhaleq/underwater-object-detection

3. Underwater Object Detection with Faster R-CNN | Upvotes: 64
URL: https://www.kaggle.com/code/lowmist/underwater-object-detection-with-faster-r-cnn

4. Penguins vs Turtles | Upvotes: 34
URL: https://www.kaggle.com/datasets/abbymorgan/penguins-vs-turtles

5. Underwater Dataset | Upvotes: 11
URL: https://www.kaggle.com/datasets/akshatsng/underwater-dataset-for-8-classes-with-label

==================================
⭐️ By: https://t.me/datasets1
❀4
Dataset Name: Diabetes dataset
Basic Description: Diabetes_updated_Dataset

πŸ“– FULL DATASET DESCRIPTION:
==================================
There are 2 types of diabetes viz. insulin-dependent diabetes mellitus (IDDM)/Type-I diabetes and non-insulin-dependent diabetes mellitus (NIDDM)/Type-II diabetes. Type-I is a disorder of carbohydrate metabolism due to insufficient insulin secretion which could be hereditary or acquired. Type-II diabetes is a condition in which the sensitivity of body cells to insulin gets reduced.
The dataset contains information about Pima Indian women, and it is often used to build predictive models to determine whether a person has diabetes based on certain features or risk factors. The dataset includes the following attributes:
Pregnancies: Number of times the woman has been pregnant. Glucose: Plasma glucose concentration in an oral glucose tolerance test. BloodPressure: Diastolic blood pressure (mm Hg). SkinThickness: Triceps skinfold thickness (mm). Insulin: 2-Hour serum insulin (mu U/ml). BMI: Body mass index (weight in kg / (height in meters)^2). DiabetesPedigreeFunction: A function that scores the likelihood of diabetes based on family history. Age: Age in years. Outcome: The target variable; 0 for no diabetes, 1 for diabetes.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (9 kB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/ashishkumarjayswal/diabetes-dataset

πŸ“Š Additional information:
==================================
File count not found
Views: 6,459
Downloads: 1,393

πŸ“š RELATED NOTEBOOKS:
==================================
1. Diabetes Dataset | Upvotes: 74
URL: https://www.kaggle.com/datasets/hasibur013/diabetes-dataset

2. India Diabetes Prediction | Upvotes: 19
URL: https://www.kaggle.com/code/ashishkumarjayswal/india-diabetes-prediction

3. Diabets Notebook | Upvotes: 14
URL: https://www.kaggle.com/code/cauelias/diabets-notebook

4. Diabetes Prediction | Upvotes: 9
URL: https://www.kaggle.com/code/harshitaaswani/diabetes-prediction

5. Diabetes pima-indians-diabetes-database | Upvotes: 5
URL: https://www.kaggle.com/datasets/imkrkannan/diabetes-pimaindiansdiabetesdatabase

==================================
⭐️ By: https://t.me/datasets1
Dataset Name: Air Quality Dataset
Basic Description: Hourly averaged responses from an array of 5 metal oxide chemical sensors

πŸ“– FULL DATASET DESCRIPTION:
==================================
This dataset contains the responses of a gas multisensor device deployed on the field in an Italian city. Hourly responses averages are recorded along with gas concentrations references from a certified analyzer. This dataset was taken from UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/index.php
The dataset contains 9357 instances of hourly averaged responses from an array of 5 metal oxide chemical sensors embedded in an Air Quality Chemical Multisensor Device. The device was located on the field in a significantly polluted area, at road level,within an Italian city. Data were recorded from March 2004 to February 2005 (one year) representing the longest freely available recordings of on field deployed air quality chemical sensor devices responses. Ground Truth hourly averaged concentrations for CO, Non Metanic Hydrocarbons, Benzene, Total Nitrogen Oxides (NOx) and Nitrogen Dioxide (NO2) and were provided by a co-located reference certified analyzer. Evidences of cross-sensitivities as well as both concept and sensor drifts are present as described in De Vito et al., Sens. And Act. B, Vol. 129,2,2008 (citation required) eventually affecting sensors concentration estimation capabilities. Missing values are tagged with -200 value. This dataset can be used exclusively for research purposes. Commercial purposes are fully excluded.
0 Date (DD/MM/YYYY) 1 Time (HH.MM.SS) 2 True hourly averaged concentration CO in mg/m^3 (reference analyzer) 3 PT08.S1 (tin oxide) hourly averaged sensor response (nominally CO targeted) 4 True hourly averaged overall Non Metanic HydroCarbons concentration in microg/m^3 (reference analyzer) 5 True hourly averaged Benzene concentration in microg/m^3 (reference analyzer) 6 PT08.S2 (titania) hourly averaged sensor response (nominally NMHC targeted) 7 True hourly averaged NOx concentration in ppb (reference analyzer) 8 PT08.S3 (tungsten oxide) hourly averaged sensor response (nominally NOx targeted) 9 True hourly averaged NO2 concentration in microg/m^3 (reference analyzer) 10 PT08.S4 (tungsten oxide) hourly averaged sensor response (nominally NO2 targeted) 11 PT08.S5 (indium oxide) hourly averaged sensor response (nominally O3 targeted) 12 Temperature in °C 13 Relative Humidity (%) 14 AH Absolute Humidity

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (254 kB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/fedesoriano/air-quality-data-set

πŸ“Š Additional information:
==================================
File count not found
Views: 191,000
Downloads: 32,700

πŸ“š RELATED NOTEBOOKS:
==================================
1. How to approach a dataset (EDA)- Learn With Me | Upvotes: 66
URL: https://www.kaggle.com/code/prakharjadaun/how-to-approach-a-dataset-eda-learn-with-me

2. Air_Q_Dataset_Exploratory_Analysis | Upvotes: 58
URL: https://www.kaggle.com/code/xande42/air-q-dataset-exploratory-analysis

3. air quality dataset | Upvotes: 49
URL: https://www.kaggle.com/datasets/tawfikelmetwally/air-quality-dataset

4. EDA_LAB01_ANN_Example | Upvotes: 32
URL: https://www.kaggle.com/code/shahidzikria/eda-lab01-ann-example

5. UCI ML Air Quality Dataset | Upvotes: 17
URL: https://www.kaggle.com/datasets/nishantbhadauria/datasetucimlairquality

==================================
⭐️ By: https://t.me/datasets1
❀5
This channels is for Programmers, Coders, Software Engineers.

0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages

βœ… https://t.me/addlist/8_rRW2scgfRhOTc0

βœ… https://t.me/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
Dataset Name: World Strat
Basic Description: 10,000km² high-resolution+low-res satellite imagery covering the 🌎🌍🌏

πŸ“– FULL DATASET DESCRIPTION:
==================================
This Kaggle upload holds only the "core" subset of the data due to the upload size limitations.
Nearly 10,000 kmΒ² of free high-resolution and matched low-resolution satellite imagery of unique locations which ensure stratified representation of all types of land-use across the world: from agriculture to ice caps, from forests to multiple urbanization densities.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (53 GB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/jucor1/worldstrat

πŸ“Š Additional information:
==================================
Total files: 217,000
Views: 9,819
Downloads: 3,343

πŸ“š RELATED NOTEBOOKS:
==================================
1. Dataset exploration | Upvotes: 44
URL: https://www.kaggle.com/code/ivanorsolic/dataset-exploration

2. Gaofen-2 satellite images - Five Billion Pixels | Upvotes: 9
URL: https://www.kaggle.com/datasets/aletbm/gaofen-satellite-images-five-billion-pixels

3. TheMiniFranceSuite | Upvotes: 9
URL: https://www.kaggle.com/datasets/javidtheimmortal/minifrance

4. WorldStrat_HR | Upvotes: 5
URL: https://www.kaggle.com/code/hseyinacemli/worldstrat-hr

5. Landshapes-4041 | Upvotes: 3
URL: https://www.kaggle.com/datasets/ueberf/sentinel-5k-truecolor

==================================
⭐️ By: https://t.me/datasets1
Dataset Name: Syrian-car-plates Dataset
Basic Description: Description not found

πŸ“– FULL DATASET DESCRIPTION:
==================================
This dataset contains 335 real-world images of Syrian car license plates collected from public sources and streets. It is intended for use in building and training Automatic License Plate Recognition (ALPR) or OCR systems, especially for Arabic-script plates.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (49 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/esraaalsaeede/syrian-car-plates-dataset

πŸ“Š Additional information:
==================================
File count not found
Views: 59
Downloads: 6

πŸ“š RELATED NOTEBOOKS:
==================================
1. EGYPlate | Upvotes: 6
URL: https://www.kaggle.com/datasets/mohamedashrafkhalifa/car-plates-numbers

2. Car License Plate Detection Dataset | Upvotes: 3
URL: https://www.kaggle.com/datasets/unidpro/license-plate-detection-dataset

3. Germany License Plate Dataset - 177 827 Images | Upvotes: 2
URL: https://www.kaggle.com/datasets/unidpro/germany-license-plate-dataset

4. Images of Nepali License Plate | Upvotes: 1
URL: https://www.kaggle.com/datasets/kshitizgajurel042/images-of-nepali-license-plate

==================================
⭐️ By: https://t.me/datasets1
❀2
Dataset Name: Fashion Product Images and Text Dataset
Basic Description: Preprocessed Dataset for Efficient Multimodal Model Training

πŸ“– FULL DATASET DESCRIPTION:
==================================
This dataset is a curated collection of fashion product images paired with their titles and descriptions, designed for training and fine-tuning multimodal AI models. Originally derived from Param Aggraval's "Fashion Product Images Dataset," it has undergone extensive preprocessing to improve usability and efficiency.
Preprocessing steps include:
These optimizations have reduced the dataset size by 73%, making it lighter and faster to use without compromising data quality. This refined dataset is ideal for research and applications in multimodal AI, including tasks like product recommendation, image-text matching, and domain-specific fine-tuning.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (3 GB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/nirmalsankalana/fashion-product-text-images-dataset

πŸ“Š Additional information:
==================================
Total files: 44,400
Views: 2,682
Downloads: 557

πŸ“š RELATED NOTEBOOKS:
==================================
1. Fashion Product Images Dataset | Upvotes: 753
URL: https://www.kaggle.com/datasets/paramaggarwal/fashion-product-images-dataset

2. Nordstrom & Myntra Clothes Image Data - GarmentIQ | Upvotes: 23
URL: https://www.kaggle.com/datasets/lygitdata/garmentiq-classification-set-nordstrom-and-myntra

3. Fashion products images dataset from farfetch | Upvotes: 20
URL: https://www.kaggle.com/datasets/crawlfeeds/images-extracted-from-fashion-website

4. numpy Weights for fashion product images dataset | Upvotes: 3
URL: https://www.kaggle.com/datasets/kalashj16/numpy-weights-for-fashion-product-images-dataset

5. Automated Refund Item Classification System | Upvotes: 2
URL: https://www.kaggle.com/code/zukhrakhongulomova/automated-refund-item-classification-system

==================================
⭐️ By: https://t.me/datasets1
❀6
Dataset Name: 🧬 Multi Cancer Dataset
Basic Description: 🧬 MultiCancer Dataset

πŸ“– FULL DATASET DESCRIPTION:
==================================
MultiCancerNet is a diverse and carefully curated image dataset designed for multi-class cancer classification and general pathology research. It consists of high-quality images gathered from various trusted sources, encompassing a wide range of cancer types, precancerous conditions, and healthy tissue samples across different organs and systems.
The dataset follows a standard PyTorch-style directory split, with clearly separated train/ and val/ folders for each class.
Example directory structure (showing class names):

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (14 GB)

πŸ”° Direct dataset download link:
URL not found

πŸ“Š Additional information:
==================================
Total files: 199,000
Views: 494
Downloads: 34

πŸ“š RELATED NOTEBOOKS:
==================================
1. Cancer Instance Segmentation and Classification 1 | Upvotes: 45
URL: https://www.kaggle.com/datasets/andrewmvd/cancer-inst-segmentation-and-classification

2. Cancer Instance Segmentation and Classification 2 | Upvotes: 16
URL: https://www.kaggle.com/datasets/andrewmvd/cancer-instance-segmentation-and-classification-2

3. Skin Cancer Classification | Upvotes: 4
URL: https://www.kaggle.com/datasets/murtozalikhon/skin-cancer-classification

4. notebook421cb41f67 | Upvotes: 1
URL: https://www.kaggle.com/code/dipeshlohchab/notebook421cb41f67

5. Cancer Detection dataset | Upvotes: 0
URL: https://www.kaggle.com/datasets/mani11111111111/cancer-detection-dataset

==================================
⭐️ By: https://t.me/datasets1
❀2
Dataset Name: Cervical Cancer Behavior Risk Data
Basic Description: Cancer classification

πŸ“– FULL DATASET DESCRIPTION:
==================================
Cancer is a disease in which cells in the body grow out of control. Cancer is always named for the part of the body where it starts, even if it spreads to other body parts later. When cancer starts in the cervix, it is called cervical cancer. Cervical cancer is cancer that starts in the cells of the cervix. The cervix is the lower, narrow end of the uterus (womb). The cervix connects the uterus to the vagina (birth canal). Cervical cancer usually develops slowly over time. Before cancer appears in the cervix, the cells of the cervix go through changes known as dysplasia, in which abnormal cells begin to appear in the cervical tissue. Over time, if not destroyed or removed, the abnormal cells may become cancer cells and start to grow and spread more deeply into the cervix and to surrounding areas.Anyone with a cervix is at risk for cervical cancer. It occurs most often in people over age 30. Long-lasting infection with certain types of human papillomavirus (HPV) is the main cause of cervical cancer. HPV is a common virus that is passed from one person to another during sex. At least half of sexually active people will have HPV at some point in their lives, but few women will get cervical cancer.
Screening tests and the HPV vaccine can help prevent cervical cancer. When cervical cancer is found early, it is highly treatable and associated with long survival and good quality of life.
Task:- This dataset consists 18 attributes to classify the target label(ca_cervix (this is class attribute, 1=has cervical cancer, 0=no cervical cancer). This is a classification task.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (1 kB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/senapatirajesh/cervical-cancer

πŸ“Š Additional information:
==================================
File count not found
Views: 3,847
Downloads: 567

πŸ“š RELATED NOTEBOOKS:
==================================
1. Cervical cancer prediction | Upvotes: 1
URL: https://www.kaggle.com/code/senapatirajesh/cervical-cancer-prediction

2. Colorectal Cancer Insights: Diagnosis & Trends | Upvotes: 1
URL: https://www.kaggle.com/datasets/danishbaariq/colorectal-cancer-insights-diagnosis-and-trends

3. Cervical Cancer dataset | Upvotes: 0
URL: https://www.kaggle.com/datasets/sambanankhumhango/cervical-cancer-dataset

4. cervical cancer risk factors | Upvotes: 0
URL: https://www.kaggle.com/datasets/mohammadhassanparvej/cervical-cancer-risk-factors

5. Cervical Cancer-dataset | Upvotes: 0
URL: https://www.kaggle.com/datasets/kevinnnm/cervical-cancer-dataset

==================================
⭐️ By: https://t.me/datasets1
❀5
Dataset Name: Keras Pretrained models
Basic Description: This dataset helps to use pretrained keras models in Kernels.

πŸ“– FULL DATASET DESCRIPTION:
==================================
Kaggle has more and more computer vision challenges. Although Kernel resources were increased recently we still can not train useful CNNs without GPU. The other main problem is that Kernels can't use network connection to download pretrained keras model weights. This dataset helps you to apply your favorite pretrained model in the Kaggle Kernel environment.
Happy data exploration and transfer learning!
Model (Top-1 Accuracy | Top -5 Accuracy)
For more information see https://keras.io/applications/

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (989 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/gaborfodor/keras-pretrained-models

πŸ“Š Additional information:
==================================
File count not found
Views: 112,000
Downloads: 29,000

πŸ“š RELATED NOTEBOOKS:
==================================
1. Brain Tumor Detection v1.0 || CNN, VGG-16 | Upvotes: 3,954
URL: https://www.kaggle.com/code/ruslankl/brain-tumor-detection-v1-0-cnn-vgg-16

2. Dog Breed - Pretrained keras models(LB 0.3) | Upvotes: 1,516
URL: https://www.kaggle.com/code/gaborfodor/dog-breed-pretrained-keras-models-lb-0-3

3. Brain Tumor MRI Classification | VGG16 | Upvotes: 1,387
URL: https://www.kaggle.com/code/loaiabdalslam/brain-tumor-mri-classification-vgg16

4. TF Keras pretrained model weights | Upvotes: 22
URL: https://www.kaggle.com/datasets/antoreepjana/tf-keras-pretrained-model-weights

5. segmentation-models 1.0.1 .whl files for TF+Keras | Upvotes: 4
URL: https://www.kaggle.com/datasets/saketpradhan/packages

==================================
⭐️ By: https://t.me/datasets1
❀3
Dataset Name: Malaria Detection
Basic Description: Dataset for Detecting Malaria from Microscopic Blood Smear Images

πŸ“– FULL DATASET DESCRIPTION:
==================================
The Malaria Detection dataset is designed for training and evaluating machine learning models to detect malaria from microscopic images of blood smears. The dataset consists of high-resolution images (224Γ—224 pixels) in JPG format, ensuring consistency and quality for effective model development.
Each of the folders β€” Train, Test, and Valid β€” contains images categorized into two classes:
Parasitized: Images of blood cells infected with malaria parasites.
Uninfected: Images of healthy blood cells without infection.
Train Folder: Contains 13,152 images used for training the machine learning model.
Helps the model learn to distinguish between Parasitized and Uninfected blood cells.
Test Folder: Contains 1,253 images used for evaluating the model’s performance after training.
Measures the model's ability to generalize and accurately classify unseen data into Parasitized and Uninfected classes.
Valid Folder: Contains 626 images used during the training process for validation.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (66 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/shahriar26s/malaria-detection

πŸ“Š Additional information:
==================================
Total files: 15,000
Views: 4,639
Downloads: 596

πŸ“š RELATED NOTEBOOKS:
==================================
1. Malaria Detection Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/orvile/p-vivax-malaria-infected-human-blood-smears

2. Cell Images Parasitized or Uninfected | Upvotes: 23
URL: https://www.kaggle.com/datasets/brsdincer/cell-images-parasitized-or-not

3. Malaria Detection Using Cnn | Upvotes: 13
URL: https://www.kaggle.com/code/shahriar26s/malaria-detection-using-cnn

4. Malaria Detection | ResNet18 | Upvotes: 7
URL: https://www.kaggle.com/code/simonecugliari/malaria-detection-resnet18

5. Malaria Detection 97% test accuracy | Upvotes: 5
URL: https://www.kaggle.com/code/ibrahimnibrahim/malaria-detection-97-test-accuracy

==================================
⭐️ By: https://t.me/datasets1
❀4
Dataset Name: Daily Temperature of Major Cities
Basic Description: Daily average temperature values recorded in major cities of the world

πŸ“– FULL DATASET DESCRIPTION:
==================================
Global warming is the ongoing rise of the average temperature of the Earth's climate system and has been demonstrated by direct temperature measurements and by measurements of various effects of the warming - Wikipedia
So a dataset on the temperature of major cities of the world will help analyze the same. Also weather information is helpful for a lot of data science tasks like sales forecasting, logistics etc.
Thanks to University of Dayton, the dataset is available as separate txt files for each city here. The data is available for research and non-commercial purposes only.. Please refer to this page for license.
Daily level average temperature values is present in city_temperature.csv file
University of Dayton for making this dataset available in the first place!
Photo credits: James Day on Unsplash
Some ideas are:

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (14 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/sudalairajkumar/daily-temperature-of-major-cities

πŸ“Š Additional information:
==================================
File count not found
Views: 262,000
Downloads: 43,300

πŸ“š RELATED NOTEBOOKS:
==================================
1. 〽️|3️⃣Ways to Deal with Time Series Forecasting | Upvotes: 296
URL: https://www.kaggle.com/code/mfaaris/3-ways-to-deal-with-time-series-forecasting

2. Studying India's AQI πŸ”Ž | Upvotes: 151
URL: https://www.kaggle.com/code/anshuls235/studying-india-s-aqi

3. Temperature prediction with TF dataset on CNN-LSTM | Upvotes: 104
URL: https://www.kaggle.com/code/gireeshs/temperature-prediction-with-tf-dataset-on-cnn-lstm

4. The Weather Dataset | Upvotes: 92
URL: https://www.kaggle.com/datasets/guillemservera/global-daily-climate-data

5. Global Rise in Temperatures in Each Country | Upvotes: 39
URL: https://www.kaggle.com/datasets/rishidamarla/global-rise-in-temperatures-in-each-country

==================================
⭐️ By: https://t.me/datasets1
❀2
Dataset Name: Flickr-Faces-HQ Dataset (FFHQ)
Basic Description: Dataset of human faces for generative adversarial networks (GAN)

πŸ“– FULL DATASET DESCRIPTION:
==================================
The dataset consists of 52,000 high-quality PNG images at 512Γ—512 resolution and contains considerable variation in terms of age, ethnicity and image background. It also has good coverage of accessories such as eyeglasses, sunglasses, hats, etc. The images were crawled from Flickr, thus inheriting all the biases of that website, and automatically aligned and cropped using dlib. Only images under permissive licenses were collected. Various automatic filters were used to prune the set, and finally Amazon Mechanical Turk was used to remove the occasional statues, paintings, or photos of photos.
For business inquiries, please contact researchinquiries@nvidia.com
For press and other inquiries, please contact Hector Marinez at hmarinez@nvidia.com

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (21 GB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arnaud58/flickrfaceshq-dataset-ffhq

πŸ“Š Additional information:
==================================
Total files: 52,000
Views: 91,800
Downloads: 21,500

πŸ“š RELATED NOTEBOOKS:
==================================
1. Image-Captioner | Upvotes: 104
URL: https://www.kaggle.com/code/dbdmobile/image-captioner

2. Helen Eye Dataset | Upvotes: 90
URL: https://www.kaggle.com/datasets/kmader/helen-eye-dataset

3. StyleGan | Upvotes: 51
URL: https://www.kaggle.com/code/samadazimiabriz/stylegan

4. Image-Captioner | Upvotes: 48
URL: https://www.kaggle.com/code/nepjunecai63/image-captioner

5. Custom Face Recognition Image Dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/unidpro/face-recognition-image-dataset

==================================
⭐️ By: https://t.me/datasets1
❀7
Dataset Name: COVID-19 CT scans
Basic Description: 20 CT scans and expert segmentations of patients with COVID-19

πŸ“– FULL DATASET DESCRIPTION:
==================================
CT scans plays a supportive role in the diagnosis of COVID-19 and is a key procedure for determining the severity that the patient finds himself in. Models that can find evidence of COVID-19 and/or characterize its findings can play a crucial role in optimizing diagnosis and treatment, especially in areas with a shortage of expert radiologists. This dataset contains 20 CT scans of patients diagnosed with COVID-19 as well as segmentations of lungs and infections made by experts.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (1 GB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/andrewmvd/covid19-ct-scans

πŸ“Š Additional information:
==================================
File count not found
Views: 211,000
Downloads: 26,700

πŸ“š RELATED NOTEBOOKS:
==================================
1. Covid-19 Detection from Lung X-rays | Upvotes: 587
URL: https://www.kaggle.com/code/eswarchandt/covid-19-detection-from-lung-x-rays

2. COVID-19 CT Scans: Getting Started | Upvotes: 430
URL: https://www.kaggle.com/code/andrewmvd/covid-19-ct-scans-getting-started

3. COVID-19 Lung CT Scan Segmentation | Upvotes: 241
URL: https://www.kaggle.com/code/akshat0007/covid-19-lung-ct-scan-segmentation

4. Large COVID-19 CT scan slice dataset | Upvotes: 88
URL: https://www.kaggle.com/datasets/maedemaftouni/large-covid19-ct-slice-dataset

5. MosMedData Chest CT Scans with COVID-19 | Upvotes: 65
URL: https://www.kaggle.com/datasets/mathurinache/mosmeddata-chest-ct-scans-with-covid19

==================================
⭐️ By: https://t.me/datasets1
❀4
Dataset Name: Bone Fracture Multi-Region X-ray Data
Basic Description: Bone Fracture Radiographic Data Across All Anatomical Regions

πŸ“– FULL DATASET DESCRIPTION:
==================================
This dataset comprises fractured and non-fractured X-ray images covering all anatomical body regions, including lower limb, upper limb, lumbar, hips, knees, etc. The dataset is categorized into train, test, and validation folders, each containing fractured and non-fractured radiographic images. Click this link https://www.kaggle.com/datasets/bmadushanirodrigo/fracture-multi-region-x-ray-data/data to access the dataset.
This dataset contains 10,580 radiographic images (X-ray) data.
Training Data Number of Images: 9246
Validation Data Number of Images: 828
Test Data Number of Images: 506

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (505 MB)

πŸ”° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/bmadushanirodrigo/fracture-multi-region-x-ray-data

πŸ“Š Additional information:
==================================
Total files: 10,600
Views: 35,500
Downloads: 9,953

πŸ“š RELATED NOTEBOOKS:
==================================
1. Bone Fracture Detection 97% Accuracy CNN | Upvotes: 97
URL: https://www.kaggle.com/code/prasadchaskar/bone-fracture-detection-97-accuracy-cnn

2. Bone Fracture Detection | 97% Accuracy | CNN | Upvotes: 52
URL: https://www.kaggle.com/code/nirmalgaud/bone-fracture-detection-97-accuracy-cnn

3. f1 > 100 Bone Fracture X-ray | TF CNN | Upvotes: 46
URL: https://www.kaggle.com/code/iasadpanwhar/f1-100-bone-fracture-x-ray-tf-cnn

4. Simple vs Comminuted Fractures X-ray Data | Upvotes: 19
URL: https://www.kaggle.com/datasets/orvile/simple-vs-comminuted-fractures-x-ray-data

5. X-Ray Dection | Upvotes: 18
URL: https://www.kaggle.com/datasets/umeradnaan/x-ray-dection

==================================
⭐️ By: https://t.me/datasets1
❀2
Dataset Name: Huggingface BERT
Basic Description: BERT models directly retrieved and updated from: https://huggingface.co/

πŸ“– FULL DATASET DESCRIPTION:
==================================
This dataset contains many popular BERT weights retrieved directly on Hugging Face's model repository, and hosted on Kaggle. It will be automatically updated every month to ensure that the latest version is available to the user. By making it a dataset, it is significantly faster to load the weights since you can directly attach a Kaggle dataset to the notebook rather than downloading the data every time. See the speed comparison notebook.
The banner was adapted from figures by Jimmy Lin (tweet; slide) released under CC BY 4.0. BERT has an Apache 2.0 license according to the model repository.
To use this dataset, simply attach it the your notebook and specify the path to the dataset. For example:
All the copyrights and IP relating to BERT belong to the original authors (Devlin et. al 2019) and Google. All copyrights relating to the transformers library belong to Hugging Face. The banner image was created thanks to Jimmy Lin so any modification of this figure should mention the original author and respect the conditions of the license; all copyrights related to the images belong to him.

πŸ“₯ DATASET DOWNLOAD INFORMATION
==================================

πŸ”΄ Dataset Size: Download dataset as zip (26 GB)

πŸ”° Direct dataset download link:
URL not found

πŸ“Š Additional information:
==================================
File count not found
Views: 42,000
Downloads: 2,572

πŸ“š RELATED NOTEBOOKS:
==================================
1. Starter Notebook: Ranked Predictions with BERT | Upvotes: 1,086
URL: https://www.kaggle.com/code/wlifferth/starter-notebook-ranked-predictions-with-bert

2. CommonLit Readability - EDA & RoBERTa TF baseline | Upvotes: 374
URL: https://www.kaggle.com/code/dimitreoliveira/commonlit-readability-eda-roberta-tf-baseline

3. πŸ“–Feedback- BaselineπŸ€— Sentence Classifier [0.226] | Upvotes: 351
URL: https://www.kaggle.com/code/julian3833/feedback-baseline-sentence-classifier-0-226

4. Huggingface BERT Variants | Upvotes: 83
URL: https://www.kaggle.com/datasets/sauravmaheshkar/huggingface-bert-variants

5. Pretrained BERT Models for PyTorch | Upvotes: 45
URL: https://www.kaggle.com/datasets/soulmachine/pretrained-bert-models-for-pytorch

==================================
⭐️ By: https://t.me/datasets1