Dataset Name: Malaria Detection
Basic Description: Dataset for Detecting Malaria from Microscopic Blood Smear Images
π FULL DATASET DESCRIPTION:
==================================
The Malaria Detection dataset is designed for training and evaluating machine learning models to detect malaria from microscopic images of blood smears. The dataset consists of high-resolution images (224Γ224 pixels) in JPG format, ensuring consistency and quality for effective model development.
Each of the folders β Train, Test, and Valid β contains images categorized into two classes:
Parasitized: Images of blood cells infected with malaria parasites.
Uninfected: Images of healthy blood cells without infection.
Train Folder: Contains 13,152 images used for training the machine learning model.
Helps the model learn to distinguish between Parasitized and Uninfected blood cells.
Test Folder: Contains 1,253 images used for evaluating the modelβs performance after training.
Measures the model's ability to generalize and accurately classify unseen data into Parasitized and Uninfected classes.
Valid Folder: Contains 626 images used during the training process for validation.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (66 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/shahriar26s/malaria-detection
π Additional information:
==================================
Total files: 15,000
Views: 4,639
Downloads: 596
π RELATED NOTEBOOKS:
==================================
1. Malaria Detection Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/orvile/p-vivax-malaria-infected-human-blood-smears
2. Cell Images Parasitized or Uninfected | Upvotes: 23
URL: https://www.kaggle.com/datasets/brsdincer/cell-images-parasitized-or-not
3. Malaria Detection Using Cnn | Upvotes: 13
URL: https://www.kaggle.com/code/shahriar26s/malaria-detection-using-cnn
4. Malaria Detection | ResNet18 | Upvotes: 7
URL: https://www.kaggle.com/code/simonecugliari/malaria-detection-resnet18
5. Malaria Detection 97% test accuracy | Upvotes: 5
URL: https://www.kaggle.com/code/ibrahimnibrahim/malaria-detection-97-test-accuracy
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Dataset for Detecting Malaria from Microscopic Blood Smear Images
π FULL DATASET DESCRIPTION:
==================================
The Malaria Detection dataset is designed for training and evaluating machine learning models to detect malaria from microscopic images of blood smears. The dataset consists of high-resolution images (224Γ224 pixels) in JPG format, ensuring consistency and quality for effective model development.
Each of the folders β Train, Test, and Valid β contains images categorized into two classes:
Parasitized: Images of blood cells infected with malaria parasites.
Uninfected: Images of healthy blood cells without infection.
Train Folder: Contains 13,152 images used for training the machine learning model.
Helps the model learn to distinguish between Parasitized and Uninfected blood cells.
Test Folder: Contains 1,253 images used for evaluating the modelβs performance after training.
Measures the model's ability to generalize and accurately classify unseen data into Parasitized and Uninfected classes.
Valid Folder: Contains 626 images used during the training process for validation.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (66 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/shahriar26s/malaria-detection
π Additional information:
==================================
Total files: 15,000
Views: 4,639
Downloads: 596
π RELATED NOTEBOOKS:
==================================
1. Malaria Detection Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/orvile/p-vivax-malaria-infected-human-blood-smears
2. Cell Images Parasitized or Uninfected | Upvotes: 23
URL: https://www.kaggle.com/datasets/brsdincer/cell-images-parasitized-or-not
3. Malaria Detection Using Cnn | Upvotes: 13
URL: https://www.kaggle.com/code/shahriar26s/malaria-detection-using-cnn
4. Malaria Detection | ResNet18 | Upvotes: 7
URL: https://www.kaggle.com/code/simonecugliari/malaria-detection-resnet18
5. Malaria Detection 97% test accuracy | Upvotes: 5
URL: https://www.kaggle.com/code/ibrahimnibrahim/malaria-detection-97-test-accuracy
==================================
βοΈ By: https://t.me/datasets1
β€4
Dataset Name: Daily Temperature of Major Cities
Basic Description: Daily average temperature values recorded in major cities of the world
π FULL DATASET DESCRIPTION:
==================================
Global warming is the ongoing rise of the average temperature of the Earth's climate system and has been demonstrated by direct temperature measurements and by measurements of various effects of the warming - Wikipedia
So a dataset on the temperature of major cities of the world will help analyze the same. Also weather information is helpful for a lot of data science tasks like sales forecasting, logistics etc.
Thanks to University of Dayton, the dataset is available as separate txt files for each city here. The data is available for research and non-commercial purposes only.. Please refer to this page for license.
Daily level average temperature values is present in city_temperature.csv file
University of Dayton for making this dataset available in the first place!
Photo credits: James Day on Unsplash
Some ideas are:
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (14 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/sudalairajkumar/daily-temperature-of-major-cities
π Additional information:
==================================
File count not found
Views: 262,000
Downloads: 43,300
π RELATED NOTEBOOKS:
==================================
1. γ½οΈ|3οΈβ£Ways to Deal with Time Series Forecasting | Upvotes: 296
URL: https://www.kaggle.com/code/mfaaris/3-ways-to-deal-with-time-series-forecasting
2. Studying India's AQI π | Upvotes: 151
URL: https://www.kaggle.com/code/anshuls235/studying-india-s-aqi
3. Temperature prediction with TF dataset on CNN-LSTM | Upvotes: 104
URL: https://www.kaggle.com/code/gireeshs/temperature-prediction-with-tf-dataset-on-cnn-lstm
4. The Weather Dataset | Upvotes: 92
URL: https://www.kaggle.com/datasets/guillemservera/global-daily-climate-data
5. Global Rise in Temperatures in Each Country | Upvotes: 39
URL: https://www.kaggle.com/datasets/rishidamarla/global-rise-in-temperatures-in-each-country
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Daily average temperature values recorded in major cities of the world
π FULL DATASET DESCRIPTION:
==================================
Global warming is the ongoing rise of the average temperature of the Earth's climate system and has been demonstrated by direct temperature measurements and by measurements of various effects of the warming - Wikipedia
So a dataset on the temperature of major cities of the world will help analyze the same. Also weather information is helpful for a lot of data science tasks like sales forecasting, logistics etc.
Thanks to University of Dayton, the dataset is available as separate txt files for each city here. The data is available for research and non-commercial purposes only.. Please refer to this page for license.
Daily level average temperature values is present in city_temperature.csv file
University of Dayton for making this dataset available in the first place!
Photo credits: James Day on Unsplash
Some ideas are:
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (14 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/sudalairajkumar/daily-temperature-of-major-cities
π Additional information:
==================================
File count not found
Views: 262,000
Downloads: 43,300
π RELATED NOTEBOOKS:
==================================
1. γ½οΈ|3οΈβ£Ways to Deal with Time Series Forecasting | Upvotes: 296
URL: https://www.kaggle.com/code/mfaaris/3-ways-to-deal-with-time-series-forecasting
2. Studying India's AQI π | Upvotes: 151
URL: https://www.kaggle.com/code/anshuls235/studying-india-s-aqi
3. Temperature prediction with TF dataset on CNN-LSTM | Upvotes: 104
URL: https://www.kaggle.com/code/gireeshs/temperature-prediction-with-tf-dataset-on-cnn-lstm
4. The Weather Dataset | Upvotes: 92
URL: https://www.kaggle.com/datasets/guillemservera/global-daily-climate-data
5. Global Rise in Temperatures in Each Country | Upvotes: 39
URL: https://www.kaggle.com/datasets/rishidamarla/global-rise-in-temperatures-in-each-country
==================================
βοΈ By: https://t.me/datasets1
β€2
Dataset Name: Flickr-Faces-HQ Dataset (FFHQ)
Basic Description: Dataset of human faces for generative adversarial networks (GAN)
π FULL DATASET DESCRIPTION:
==================================
The dataset consists of 52,000 high-quality PNG images at 512Γ512 resolution and contains considerable variation in terms of age, ethnicity and image background. It also has good coverage of accessories such as eyeglasses, sunglasses, hats, etc. The images were crawled from Flickr, thus inheriting all the biases of that website, and automatically aligned and cropped using dlib. Only images under permissive licenses were collected. Various automatic filters were used to prune the set, and finally Amazon Mechanical Turk was used to remove the occasional statues, paintings, or photos of photos.
For business inquiries, please contact researchinquiries@nvidia.com
For press and other inquiries, please contact Hector Marinez at hmarinez@nvidia.com
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (21 GB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arnaud58/flickrfaceshq-dataset-ffhq
π Additional information:
==================================
Total files: 52,000
Views: 91,800
Downloads: 21,500
π RELATED NOTEBOOKS:
==================================
1. Image-Captioner | Upvotes: 104
URL: https://www.kaggle.com/code/dbdmobile/image-captioner
2. Helen Eye Dataset | Upvotes: 90
URL: https://www.kaggle.com/datasets/kmader/helen-eye-dataset
3. StyleGan | Upvotes: 51
URL: https://www.kaggle.com/code/samadazimiabriz/stylegan
4. Image-Captioner | Upvotes: 48
URL: https://www.kaggle.com/code/nepjunecai63/image-captioner
5. Custom Face Recognition Image Dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/unidpro/face-recognition-image-dataset
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Dataset of human faces for generative adversarial networks (GAN)
π FULL DATASET DESCRIPTION:
==================================
The dataset consists of 52,000 high-quality PNG images at 512Γ512 resolution and contains considerable variation in terms of age, ethnicity and image background. It also has good coverage of accessories such as eyeglasses, sunglasses, hats, etc. The images were crawled from Flickr, thus inheriting all the biases of that website, and automatically aligned and cropped using dlib. Only images under permissive licenses were collected. Various automatic filters were used to prune the set, and finally Amazon Mechanical Turk was used to remove the occasional statues, paintings, or photos of photos.
For business inquiries, please contact researchinquiries@nvidia.com
For press and other inquiries, please contact Hector Marinez at hmarinez@nvidia.com
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (21 GB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arnaud58/flickrfaceshq-dataset-ffhq
π Additional information:
==================================
Total files: 52,000
Views: 91,800
Downloads: 21,500
π RELATED NOTEBOOKS:
==================================
1. Image-Captioner | Upvotes: 104
URL: https://www.kaggle.com/code/dbdmobile/image-captioner
2. Helen Eye Dataset | Upvotes: 90
URL: https://www.kaggle.com/datasets/kmader/helen-eye-dataset
3. StyleGan | Upvotes: 51
URL: https://www.kaggle.com/code/samadazimiabriz/stylegan
4. Image-Captioner | Upvotes: 48
URL: https://www.kaggle.com/code/nepjunecai63/image-captioner
5. Custom Face Recognition Image Dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/unidpro/face-recognition-image-dataset
==================================
βοΈ By: https://t.me/datasets1
β€7
Dataset Name: COVID-19 CT scans
Basic Description: 20 CT scans and expert segmentations of patients with COVID-19
π FULL DATASET DESCRIPTION:
==================================
CT scans plays a supportive role in the diagnosis of COVID-19 and is a key procedure for determining the severity that the patient finds himself in. Models that can find evidence of COVID-19 and/or characterize its findings can play a crucial role in optimizing diagnosis and treatment, especially in areas with a shortage of expert radiologists. This dataset contains 20 CT scans of patients diagnosed with COVID-19 as well as segmentations of lungs and infections made by experts.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (1 GB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/andrewmvd/covid19-ct-scans
π Additional information:
==================================
File count not found
Views: 211,000
Downloads: 26,700
π RELATED NOTEBOOKS:
==================================
1. Covid-19 Detection from Lung X-rays | Upvotes: 587
URL: https://www.kaggle.com/code/eswarchandt/covid-19-detection-from-lung-x-rays
2. COVID-19 CT Scans: Getting Started | Upvotes: 430
URL: https://www.kaggle.com/code/andrewmvd/covid-19-ct-scans-getting-started
3. COVID-19 Lung CT Scan Segmentation | Upvotes: 241
URL: https://www.kaggle.com/code/akshat0007/covid-19-lung-ct-scan-segmentation
4. Large COVID-19 CT scan slice dataset | Upvotes: 88
URL: https://www.kaggle.com/datasets/maedemaftouni/large-covid19-ct-slice-dataset
5. MosMedData Chest CT Scans with COVID-19 | Upvotes: 65
URL: https://www.kaggle.com/datasets/mathurinache/mosmeddata-chest-ct-scans-with-covid19
==================================
βοΈ By: https://t.me/datasets1
Basic Description: 20 CT scans and expert segmentations of patients with COVID-19
π FULL DATASET DESCRIPTION:
==================================
CT scans plays a supportive role in the diagnosis of COVID-19 and is a key procedure for determining the severity that the patient finds himself in. Models that can find evidence of COVID-19 and/or characterize its findings can play a crucial role in optimizing diagnosis and treatment, especially in areas with a shortage of expert radiologists. This dataset contains 20 CT scans of patients diagnosed with COVID-19 as well as segmentations of lungs and infections made by experts.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (1 GB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/andrewmvd/covid19-ct-scans
π Additional information:
==================================
File count not found
Views: 211,000
Downloads: 26,700
π RELATED NOTEBOOKS:
==================================
1. Covid-19 Detection from Lung X-rays | Upvotes: 587
URL: https://www.kaggle.com/code/eswarchandt/covid-19-detection-from-lung-x-rays
2. COVID-19 CT Scans: Getting Started | Upvotes: 430
URL: https://www.kaggle.com/code/andrewmvd/covid-19-ct-scans-getting-started
3. COVID-19 Lung CT Scan Segmentation | Upvotes: 241
URL: https://www.kaggle.com/code/akshat0007/covid-19-lung-ct-scan-segmentation
4. Large COVID-19 CT scan slice dataset | Upvotes: 88
URL: https://www.kaggle.com/datasets/maedemaftouni/large-covid19-ct-slice-dataset
5. MosMedData Chest CT Scans with COVID-19 | Upvotes: 65
URL: https://www.kaggle.com/datasets/mathurinache/mosmeddata-chest-ct-scans-with-covid19
==================================
βοΈ By: https://t.me/datasets1
β€4
Dataset Name: Bone Fracture Multi-Region X-ray Data
Basic Description: Bone Fracture Radiographic Data Across All Anatomical Regions
π FULL DATASET DESCRIPTION:
==================================
This dataset comprises fractured and non-fractured X-ray images covering all anatomical body regions, including lower limb, upper limb, lumbar, hips, knees, etc. The dataset is categorized into train, test, and validation folders, each containing fractured and non-fractured radiographic images. Click this link https://www.kaggle.com/datasets/bmadushanirodrigo/fracture-multi-region-x-ray-data/data to access the dataset.
This dataset contains 10,580 radiographic images (X-ray) data.
Training Data Number of Images: 9246
Validation Data Number of Images: 828
Test Data Number of Images: 506
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (505 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/bmadushanirodrigo/fracture-multi-region-x-ray-data
π Additional information:
==================================
Total files: 10,600
Views: 35,500
Downloads: 9,953
π RELATED NOTEBOOKS:
==================================
1. Bone Fracture Detection 97% Accuracy CNN | Upvotes: 97
URL: https://www.kaggle.com/code/prasadchaskar/bone-fracture-detection-97-accuracy-cnn
2. Bone Fracture Detection | 97% Accuracy | CNN | Upvotes: 52
URL: https://www.kaggle.com/code/nirmalgaud/bone-fracture-detection-97-accuracy-cnn
3. f1 > 100 Bone Fracture X-ray | TF CNN | Upvotes: 46
URL: https://www.kaggle.com/code/iasadpanwhar/f1-100-bone-fracture-x-ray-tf-cnn
4. Simple vs Comminuted Fractures X-ray Data | Upvotes: 19
URL: https://www.kaggle.com/datasets/orvile/simple-vs-comminuted-fractures-x-ray-data
5. X-Ray Dection | Upvotes: 18
URL: https://www.kaggle.com/datasets/umeradnaan/x-ray-dection
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Bone Fracture Radiographic Data Across All Anatomical Regions
π FULL DATASET DESCRIPTION:
==================================
This dataset comprises fractured and non-fractured X-ray images covering all anatomical body regions, including lower limb, upper limb, lumbar, hips, knees, etc. The dataset is categorized into train, test, and validation folders, each containing fractured and non-fractured radiographic images. Click this link https://www.kaggle.com/datasets/bmadushanirodrigo/fracture-multi-region-x-ray-data/data to access the dataset.
This dataset contains 10,580 radiographic images (X-ray) data.
Training Data Number of Images: 9246
Validation Data Number of Images: 828
Test Data Number of Images: 506
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (505 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/bmadushanirodrigo/fracture-multi-region-x-ray-data
π Additional information:
==================================
Total files: 10,600
Views: 35,500
Downloads: 9,953
π RELATED NOTEBOOKS:
==================================
1. Bone Fracture Detection
URL: https://www.kaggle.com/code/prasadchaskar/bone-fracture-detection-97-accuracy-cnn
2. Bone Fracture Detection | 97% Accuracy | CNN | Upvotes: 52
URL: https://www.kaggle.com/code/nirmalgaud/bone-fracture-detection-97-accuracy-cnn
3. f1 > 100 Bone Fracture X-ray | TF CNN | Upvotes: 46
URL: https://www.kaggle.com/code/iasadpanwhar/f1-100-bone-fracture-x-ray-tf-cnn
4. Simple vs Comminuted Fractures X-ray Data | Upvotes: 19
URL: https://www.kaggle.com/datasets/orvile/simple-vs-comminuted-fractures-x-ray-data
5. X-Ray Dection | Upvotes: 18
URL: https://www.kaggle.com/datasets/umeradnaan/x-ray-dection
==================================
βοΈ By: https://t.me/datasets1
β€2
Dataset Name: Huggingface BERT
Basic Description: BERT models directly retrieved and updated from: https://huggingface.co/
π FULL DATASET DESCRIPTION:
==================================
This dataset contains many popular BERT weights retrieved directly on Hugging Face's model repository, and hosted on Kaggle. It will be automatically updated every month to ensure that the latest version is available to the user. By making it a dataset, it is significantly faster to load the weights since you can directly attach a Kaggle dataset to the notebook rather than downloading the data every time. See the speed comparison notebook.
The banner was adapted from figures by Jimmy Lin (tweet; slide) released under CC BY 4.0. BERT has an Apache 2.0 license according to the model repository.
To use this dataset, simply attach it the your notebook and specify the path to the dataset. For example:
All the copyrights and IP relating to BERT belong to the original authors (Devlin et. al 2019) and Google. All copyrights relating to the transformers library belong to Hugging Face. The banner image was created thanks to Jimmy Lin so any modification of this figure should mention the original author and respect the conditions of the license; all copyrights related to the images belong to him.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (26 GB)
π° Direct dataset download link:
URL not found
π Additional information:
==================================
File count not found
Views: 42,000
Downloads: 2,572
π RELATED NOTEBOOKS:
==================================
1. Starter Notebook: Ranked Predictions with BERT | Upvotes: 1,086
URL: https://www.kaggle.com/code/wlifferth/starter-notebook-ranked-predictions-with-bert
2. CommonLit Readability - EDA & RoBERTa TF baseline | Upvotes: 374
URL: https://www.kaggle.com/code/dimitreoliveira/commonlit-readability-eda-roberta-tf-baseline
3. πFeedback- Baselineπ€ Sentence Classifier [0.226] | Upvotes: 351
URL: https://www.kaggle.com/code/julian3833/feedback-baseline-sentence-classifier-0-226
4. Huggingface BERT Variants | Upvotes: 83
URL: https://www.kaggle.com/datasets/sauravmaheshkar/huggingface-bert-variants
5. Pretrained BERT Models for PyTorch | Upvotes: 45
URL: https://www.kaggle.com/datasets/soulmachine/pretrained-bert-models-for-pytorch
==================================
βοΈ By: https://t.me/datasets1
Basic Description: BERT models directly retrieved and updated from: https://huggingface.co/
π FULL DATASET DESCRIPTION:
==================================
This dataset contains many popular BERT weights retrieved directly on Hugging Face's model repository, and hosted on Kaggle. It will be automatically updated every month to ensure that the latest version is available to the user. By making it a dataset, it is significantly faster to load the weights since you can directly attach a Kaggle dataset to the notebook rather than downloading the data every time. See the speed comparison notebook.
The banner was adapted from figures by Jimmy Lin (tweet; slide) released under CC BY 4.0. BERT has an Apache 2.0 license according to the model repository.
To use this dataset, simply attach it the your notebook and specify the path to the dataset. For example:
All the copyrights and IP relating to BERT belong to the original authors (Devlin et. al 2019) and Google. All copyrights relating to the transformers library belong to Hugging Face. The banner image was created thanks to Jimmy Lin so any modification of this figure should mention the original author and respect the conditions of the license; all copyrights related to the images belong to him.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (26 GB)
π° Direct dataset download link:
URL not found
π Additional information:
==================================
File count not found
Views: 42,000
Downloads: 2,572
π RELATED NOTEBOOKS:
==================================
1. Starter Notebook: Ranked Predictions with BERT | Upvotes: 1,086
URL: https://www.kaggle.com/code/wlifferth/starter-notebook-ranked-predictions-with-bert
2. CommonLit Readability - EDA & RoBERTa TF baseline | Upvotes: 374
URL: https://www.kaggle.com/code/dimitreoliveira/commonlit-readability-eda-roberta-tf-baseline
3. πFeedback- Baselineπ€ Sentence Classifier [0.226] | Upvotes: 351
URL: https://www.kaggle.com/code/julian3833/feedback-baseline-sentence-classifier-0-226
4. Huggingface BERT Variants | Upvotes: 83
URL: https://www.kaggle.com/datasets/sauravmaheshkar/huggingface-bert-variants
5. Pretrained BERT Models for PyTorch | Upvotes: 45
URL: https://www.kaggle.com/datasets/soulmachine/pretrained-bert-models-for-pytorch
==================================
βοΈ By: https://t.me/datasets1
Dataset Name: Book Recommendation Dataset
Basic Description: Build state-of-the-art models for book recommendation system
π FULL DATASET DESCRIPTION:
==================================
During the last few decades, with the rise of Youtube, Amazon, Netflix and many other such web services, recommender systems have taken more and more place in our lives. From e-commerce (suggest to buyers articles that could interest them) to online advertisement (suggest to users the right contents, matching their preferences), recommender systems are today unavoidable in our daily online journeys. In a very general way, recommender systems are algorithms aimed at suggesting relevant items to users (items being movies to watch, text to read, products to buy or anything else depending on industries).
Recommender systems are really critical in some industries as they can generate a huge amount of income when they are efficient or also be a way to stand out significantly from competitors. As a proof of the importance of recommender systems, we can mention that, a few years ago, Netflix organised a challenges (the βNetflix prizeβ) where the goal was to produce a recommender system that performs better than its own algorithm with a prize of 1 million dollars to win.
Image: Stuttgart City Library | Stuttgart, Germany, PHOTO: DIETER WEINELT, FLICKR
The Book-Crossing dataset comprises 3 files.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (26 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arashnic/book-recommendation-dataset
π Additional information:
==================================
File count not found
Views: 374,000
Downloads: 103,000
π RELATED NOTEBOOKS:
==================================
1. Book Recommendation Systemππ | Upvotes: 354
URL: https://www.kaggle.com/code/fahadmehfoooz/book-recommendation-system
2. ππBOOK RECOMMENDER | Upvotes: 257
URL: https://www.kaggle.com/code/hilalmleykeyuksel/book-recommender
3. πbook_recommender_KNN | Upvotes: 151
URL: https://www.kaggle.com/code/danishammar/book-recommender-knn
4. Amazon Products Sold on ModCloth | Upvotes: 19
URL: https://www.kaggle.com/datasets/arashnic/marketing-bias-dataset
5. Goodbooks 10k Updated | Upvotes: 9
URL: https://www.kaggle.com/datasets/alexanderfrosati/goodbooks-10k-updated
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Build state-of-the-art models for book recommendation system
π FULL DATASET DESCRIPTION:
==================================
During the last few decades, with the rise of Youtube, Amazon, Netflix and many other such web services, recommender systems have taken more and more place in our lives. From e-commerce (suggest to buyers articles that could interest them) to online advertisement (suggest to users the right contents, matching their preferences), recommender systems are today unavoidable in our daily online journeys. In a very general way, recommender systems are algorithms aimed at suggesting relevant items to users (items being movies to watch, text to read, products to buy or anything else depending on industries).
Recommender systems are really critical in some industries as they can generate a huge amount of income when they are efficient or also be a way to stand out significantly from competitors. As a proof of the importance of recommender systems, we can mention that, a few years ago, Netflix organised a challenges (the βNetflix prizeβ) where the goal was to produce a recommender system that performs better than its own algorithm with a prize of 1 million dollars to win.
Image: Stuttgart City Library | Stuttgart, Germany, PHOTO: DIETER WEINELT, FLICKR
The Book-Crossing dataset comprises 3 files.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (26 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/arashnic/book-recommendation-dataset
π Additional information:
==================================
File count not found
Views: 374,000
Downloads: 103,000
π RELATED NOTEBOOKS:
==================================
1. Book Recommendation Systemππ | Upvotes: 354
URL: https://www.kaggle.com/code/fahadmehfoooz/book-recommendation-system
2. ππBOOK RECOMMENDER | Upvotes: 257
URL: https://www.kaggle.com/code/hilalmleykeyuksel/book-recommender
3. πbook_recommender_KNN | Upvotes: 151
URL: https://www.kaggle.com/code/danishammar/book-recommender-knn
4. Amazon Products Sold on ModCloth | Upvotes: 19
URL: https://www.kaggle.com/datasets/arashnic/marketing-bias-dataset
5. Goodbooks 10k Updated | Upvotes: 9
URL: https://www.kaggle.com/datasets/alexanderfrosati/goodbooks-10k-updated
==================================
βοΈ By: https://t.me/datasets1
β€3
Forwarded from Machine Learning with Python
https://t.me/DataScienceN.
We have created a channel to guide students towards their educational paths correctly
Join our channel
We have created a channel to guide students towards their educational paths correctly
Join our channel
Telegram
Data Science Jupyter Notebooks
Explore the world of Data Science through Jupyter Notebooksβinsights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
β€1
Dataset Name: Wine Reviews
Basic Description: 130k wine reviews with variety, location, winery, price, and description
π FULL DATASET DESCRIPTION:
==================================
After watching Somm (a documentary on master sommeliers) I wondered how I could create a predictive model to identify wines through blind tasting like a master sommelier would. The first step in this journey was gathering some data to train a model. I plan to use deep learning to predict the wine variety using words in the description/review. The model still won't be able to taste the wine, but theoretically it could identify the wine based on a description that a sommelier could give. If anyone has any ideas on how to accomplish this, please post them!
This dataset contains three files:
winemag-data-130k-v2.csv contains 10 columns and 130k rows of wine reviews.
winemag-data_first150k.csv contains 10 columns and 150k rows of wine reviews.
winemag-data-130k-v2.json contains 6919 nodes of wine reviews.
Click on the data tab to see individual file descriptions, column-level metadata and summary statistics.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (53 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/zynicide/wine-reviews
π Additional information:
==================================
File count not found
Downloads: 331,000
π RELATED NOTEBOOKS:
==================================
1. Exercise: Creating, Reading and Writing | Upvotes: 454,421
URL: https://www.kaggle.com/code/residentmario/exercise-creating-reading-and-writing
2. Exercise: Indexing, Selecting & Assigning | Upvotes: 320,767
URL: https://www.kaggle.com/code/residentmario/exercise-indexing-selecting-assigning
3. Exercise: Summary Functions and Maps | Upvotes: 270,328
URL: https://www.kaggle.com/code/residentmario/exercise-summary-functions-and-maps
4. Spanish Wine Quality Dataset | Upvotes: 142
URL: https://www.kaggle.com/datasets/fedesoriano/spanish-wine-quality-dataset
5. wine quality selection | Upvotes: 33
URL: https://www.kaggle.com/datasets/maitree/wine-quality-selection
==================================
βοΈ By: https://t.me/datasets1
Basic Description: 130k wine reviews with variety, location, winery, price, and description
π FULL DATASET DESCRIPTION:
==================================
After watching Somm (a documentary on master sommeliers) I wondered how I could create a predictive model to identify wines through blind tasting like a master sommelier would. The first step in this journey was gathering some data to train a model. I plan to use deep learning to predict the wine variety using words in the description/review. The model still won't be able to taste the wine, but theoretically it could identify the wine based on a description that a sommelier could give. If anyone has any ideas on how to accomplish this, please post them!
This dataset contains three files:
winemag-data-130k-v2.csv contains 10 columns and 130k rows of wine reviews.
winemag-data_first150k.csv contains 10 columns and 150k rows of wine reviews.
winemag-data-130k-v2.json contains 6919 nodes of wine reviews.
Click on the data tab to see individual file descriptions, column-level metadata and summary statistics.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (53 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/zynicide/wine-reviews
π Additional information:
==================================
File count not found
Downloads: 331,000
π RELATED NOTEBOOKS:
==================================
1. Exercise: Creating, Reading and Writing | Upvotes: 454,421
URL: https://www.kaggle.com/code/residentmario/exercise-creating-reading-and-writing
2. Exercise: Indexing, Selecting & Assigning | Upvotes: 320,767
URL: https://www.kaggle.com/code/residentmario/exercise-indexing-selecting-assigning
3. Exercise: Summary Functions and Maps | Upvotes: 270,328
URL: https://www.kaggle.com/code/residentmario/exercise-summary-functions-and-maps
4. Spanish Wine Quality Dataset | Upvotes: 142
URL: https://www.kaggle.com/datasets/fedesoriano/spanish-wine-quality-dataset
5. wine quality selection | Upvotes: 33
URL: https://www.kaggle.com/datasets/maitree/wine-quality-selection
==================================
βοΈ By: https://t.me/datasets1
β€3
Dataset Name: Drowsiness Detection Dataset
Basic Description: UnityEyes - Openned/Closed Eyes - Sleepy Driver Detection
π FULL DATASET DESCRIPTION:
==================================
Welcome to the UnityEyes Drowsiness Detection Dataset! This comprehensive dataset is designed to aid researchers and developers in the critical task of drowsiness detection, specifically focusing on identifying sleepy drivers based on eye behavior. The dataset was collected using UnityEyes, a state-of-the-art eye-synthetic simulator, ensuring high-quality data. It was labelled using an arbitrary threshold of openness=20 (reference: https://github.com/SNTSVV/HUDD-Toolset)
The Drowsiness Detection Dataset comprises a diverse collection of eye movement recordings from subjects of varying demographics, captured under controlled driving scenarios. The data includes sequences of eye images, meticulously labeled to indicate whether the eyes are open or closed, serving as ground truth for sleepy driver detection.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (552 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/hazemfahmy/openned-closed-eyes
π Additional information:
==================================
Total files: 88,500
Views: 20,000
Downloads: 2,673
π RELATED NOTEBOOKS:
==================================
1. drowsiness Detection Using YOLOv8 | Upvotes: 113
URL: https://www.kaggle.com/code/gauravsrivastav2507/drowsiness-detection-using-yolov8
2. ResNet50 val_accuracy:0.946 | Upvotes: 26
URL: https://www.kaggle.com/code/shivamsingh17072001/resnet50-val-accuracy-0-946
3. Drowsy Driver Detection - Omer Mustafa - Hitesh | Upvotes: 24
URL: https://www.kaggle.com/code/theomermustafa/drowsy-driver-detection-omer-mustafa-hitesh
4. MRL Eye Dataset | Upvotes: 12
URL: https://www.kaggle.com/datasets/akashshingha850/mrl-eye-dataset
5. Eye Dataset (open/close) for drowsiness prediction | Upvotes: 9
URL: https://www.kaggle.com/datasets/dhirdevansh/eye-dataset-openclose-for-drowsiness-prediction
==================================
βοΈ By: https://t.me/datasets1
Basic Description: UnityEyes - Openned/Closed Eyes - Sleepy Driver Detection
π FULL DATASET DESCRIPTION:
==================================
Welcome to the UnityEyes Drowsiness Detection Dataset! This comprehensive dataset is designed to aid researchers and developers in the critical task of drowsiness detection, specifically focusing on identifying sleepy drivers based on eye behavior. The dataset was collected using UnityEyes, a state-of-the-art eye-synthetic simulator, ensuring high-quality data. It was labelled using an arbitrary threshold of openness=20 (reference: https://github.com/SNTSVV/HUDD-Toolset)
The Drowsiness Detection Dataset comprises a diverse collection of eye movement recordings from subjects of varying demographics, captured under controlled driving scenarios. The data includes sequences of eye images, meticulously labeled to indicate whether the eyes are open or closed, serving as ground truth for sleepy driver detection.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (552 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/hazemfahmy/openned-closed-eyes
π Additional information:
==================================
Total files: 88,500
Views: 20,000
Downloads: 2,673
π RELATED NOTEBOOKS:
==================================
1. drowsiness Detection Using YOLOv8 | Upvotes: 113
URL: https://www.kaggle.com/code/gauravsrivastav2507/drowsiness-detection-using-yolov8
2. ResNet50 val_accuracy:0.946 | Upvotes: 26
URL: https://www.kaggle.com/code/shivamsingh17072001/resnet50-val-accuracy-0-946
3. Drowsy Driver Detection - Omer Mustafa - Hitesh | Upvotes: 24
URL: https://www.kaggle.com/code/theomermustafa/drowsy-driver-detection-omer-mustafa-hitesh
4. MRL Eye Dataset | Upvotes: 12
URL: https://www.kaggle.com/datasets/akashshingha850/mrl-eye-dataset
5. Eye Dataset (open/close) for drowsiness prediction | Upvotes: 9
URL: https://www.kaggle.com/datasets/dhirdevansh/eye-dataset-openclose-for-drowsiness-prediction
==================================
βοΈ By: https://t.me/datasets1
β€3
Dataset Name: Garbage Dataset
Basic Description: A Comprehensive Image Dataset for Garbage Classification and Recycling
π FULL DATASET DESCRIPTION:
==================================
This dataset contains images of garbage items categorized into 10 classes, designed for machine learning and computer vision projects focusing on recycling and waste management. It is ideal for building classification or object detection models or developing AI-powered solutions for sustainable waste disposal.
Dataset Summary
The dataset features 10 distinct classes of garbage with a total of 19,762 images, distributed as follows:
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (780 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/sumn2u/garbage-classification-v2
π Additional information:
==================================
Total files: 19,800
Views: 53,200
Downloads: 12,100
π RELATED NOTEBOOKS:
==================================
1. Garbage Classification (ResNet) | Upvotes: 57
URL: https://www.kaggle.com/code/sumn2u/garbage-classification-resnet
2. garbage-classification: DenseNet201&ResNet101V2 | Upvotes: 49
URL: https://www.kaggle.com/code/ztrollk/garbage-classification-densenet201-resnet101v2
3. Garbage Classification (Transfer Learning) | Upvotes: 39
URL: https://www.kaggle.com/code/sumn2u/garbage-classification-transfer-learning
4. Waste Materials classification Data | Upvotes: 5
URL: https://www.kaggle.com/datasets/isaacritharson/metal-glassgarbage-classification-data
5. Garbage Image classification | Upvotes: 1
URL: https://www.kaggle.com/datasets/isratjahan123/garbage-image-classification
==================================
βοΈ By: https://t.me/datasets1
Basic Description: A Comprehensive Image Dataset for Garbage Classification and Recycling
π FULL DATASET DESCRIPTION:
==================================
This dataset contains images of garbage items categorized into 10 classes, designed for machine learning and computer vision projects focusing on recycling and waste management. It is ideal for building classification or object detection models or developing AI-powered solutions for sustainable waste disposal.
Dataset Summary
The dataset features 10 distinct classes of garbage with a total of 19,762 images, distributed as follows:
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (780 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/sumn2u/garbage-classification-v2
π Additional information:
==================================
Total files: 19,800
Views: 53,200
Downloads: 12,100
π RELATED NOTEBOOKS:
==================================
1. Garbage Classification (ResNet) | Upvotes: 57
URL: https://www.kaggle.com/code/sumn2u/garbage-classification-resnet
2. garbage-classification: DenseNet201&ResNet101V2 | Upvotes: 49
URL: https://www.kaggle.com/code/ztrollk/garbage-classification-densenet201-resnet101v2
3. Garbage Classification (Transfer Learning) | Upvotes: 39
URL: https://www.kaggle.com/code/sumn2u/garbage-classification-transfer-learning
4. Waste Materials classification Data | Upvotes: 5
URL: https://www.kaggle.com/datasets/isaacritharson/metal-glassgarbage-classification-data
5. Garbage Image classification | Upvotes: 1
URL: https://www.kaggle.com/datasets/isratjahan123/garbage-image-classification
==================================
βοΈ By: https://t.me/datasets1
β€2
Dataset Name: Breast Cancer
Basic Description: Breast Tumor Mammography Dataset for Computer Vision
π FULL DATASET DESCRIPTION:
==================================
This dataset contains 3,383 mammogram images focused on breast tumors, annotated in a folder structure. The dataset was exported from Roboflow, a platform for computer vision projects. It is ideal for building and testing Deep-learning models aimed at detecting breast tumors through mammograms.
Auto-orientation of pixel data (EXIF-orientation stripping) Resized to 640x640 pixels
This dataset can be used for various computer vision tasks, including:
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (91 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/hayder17/breast-cancer-detection
π Additional information:
==================================
File count not found
Views: 18,700
Downloads: 3,436
π RELATED NOTEBOOKS:
==================================
1. RSNA | CNN Breast Cancer | Upvotes: 88
URL: https://www.kaggle.com/code/gallo33henrique/rsna-cnn-breast-cancer
2. Merge BC DataSet | Upvotes: 26
URL: https://www.kaggle.com/code/hayder17/merge-bc-dataset
3. Breast Segmentation | LLM MedGemma | Upvotes: 22
URL: https://www.kaggle.com/code/gallo33henrique/breast-segmentation-llm-medgemma
4. RNSA-Mammography-512px-8bit | Upvotes: 3
URL: https://www.kaggle.com/datasets/deltaechov/rnsamamographt512px8bit
5. Breast MRI Tumor Classification Dataset | Upvotes: 0
URL: https://www.kaggle.com/datasets/abenjelloun/breast-mri-tumor-classification-dataset
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Breast Tumor Mammography Dataset for Computer Vision
π FULL DATASET DESCRIPTION:
==================================
This dataset contains 3,383 mammogram images focused on breast tumors, annotated in a folder structure. The dataset was exported from Roboflow, a platform for computer vision projects. It is ideal for building and testing Deep-learning models aimed at detecting breast tumors through mammograms.
Auto-orientation of pixel data (EXIF-orientation stripping) Resized to 640x640 pixels
This dataset can be used for various computer vision tasks, including:
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (91 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/hayder17/breast-cancer-detection
π Additional information:
==================================
File count not found
Views: 18,700
Downloads: 3,436
π RELATED NOTEBOOKS:
==================================
1. RSNA | CNN Breast Cancer | Upvotes: 88
URL: https://www.kaggle.com/code/gallo33henrique/rsna-cnn-breast-cancer
2. Merge BC DataSet | Upvotes: 26
URL: https://www.kaggle.com/code/hayder17/merge-bc-dataset
3. Breast Segmentation | LLM MedGemma | Upvotes: 22
URL: https://www.kaggle.com/code/gallo33henrique/breast-segmentation-llm-medgemma
4. RNSA-Mammography-512px-8bit | Upvotes: 3
URL: https://www.kaggle.com/datasets/deltaechov/rnsamamographt512px8bit
5. Breast MRI Tumor Classification Dataset | Upvotes: 0
URL: https://www.kaggle.com/datasets/abenjelloun/breast-mri-tumor-classification-dataset
==================================
βοΈ By: https://t.me/datasets1
β€3
Dataset Name: Malaria: Plasmodium Vivax Species
Basic Description: The Malaria Parasite Image Database for Image Processing and Analysis
π FULL DATASET DESCRIPTION:
==================================
Malaria is caused by protozoan parasites of the genus Plasmodium that are transmitted through the bites of infected female Anopheles mosquitoes and that infect the red blood cells. Most deaths occur among children in Africa, where a child dies almost every minute from malaria, and where malaria is a leading cause of childhood neuro-disability. Malaria remains a major burden on global health, with roughly 200 million cases worldwide and more than 400,000 deaths per year. Besides biomedical research and political efforts, modern information technology is playing a key role in many attempts at fighting the disease. One of the barriers to a successful mortality reduction has been inadequate malaria diagnosis in particular. To improve diagnosis, image analysis software and machine learning methods have been used to quantify parasitemia in microscopic blood slides.
There are 5 Plasmodium species that cause malaria in human: Plasmodium falciparum, Plasmodium vivax, Plasmodium malariae, Plasmodium ovale, and Plasmodium knowlesi. The 2 most common species are P. falciparum and P. vivax. P. falciparum is the most severe form and is responsible for most malaria-related deaths globally. And Plasmodium knowlesi is rarely found.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (38 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/saife245/malaria-plasmodium-vivax-specie
π Additional information:
==================================
File count not found
Views: 8,932
Downloads: 460
π RELATED NOTEBOOKS:
==================================
1. Malaria Bounding Boxes | Upvotes: 132
URL: https://www.kaggle.com/datasets/kmader/malaria-bounding-boxes
2. Malaria Detection Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/orvile/p-vivax-malaria-infected-human-blood-smears
3. Images preprocessing | Upvotes: 16
URL: https://www.kaggle.com/code/mohamedsalemmohamed/images-preprocessing
4. MP-IDB-YOLO: YOLO-Formatted MP-IDB Malaria Dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/rayhanadi/yolo-formatted-mp-idb-malaria-dataset
5. Starter: Malaria: Plasmodium Vivax 66f0ab62-b | Upvotes: 3
URL: https://www.kaggle.com/code/kerneler/starter-malaria-plasmodium-vivax-66f0ab62-b
==================================
βοΈ By: https://t.me/datasets1
Basic Description: The Malaria Parasite Image Database for Image Processing and Analysis
π FULL DATASET DESCRIPTION:
==================================
Malaria is caused by protozoan parasites of the genus Plasmodium that are transmitted through the bites of infected female Anopheles mosquitoes and that infect the red blood cells. Most deaths occur among children in Africa, where a child dies almost every minute from malaria, and where malaria is a leading cause of childhood neuro-disability. Malaria remains a major burden on global health, with roughly 200 million cases worldwide and more than 400,000 deaths per year. Besides biomedical research and political efforts, modern information technology is playing a key role in many attempts at fighting the disease. One of the barriers to a successful mortality reduction has been inadequate malaria diagnosis in particular. To improve diagnosis, image analysis software and machine learning methods have been used to quantify parasitemia in microscopic blood slides.
There are 5 Plasmodium species that cause malaria in human: Plasmodium falciparum, Plasmodium vivax, Plasmodium malariae, Plasmodium ovale, and Plasmodium knowlesi. The 2 most common species are P. falciparum and P. vivax. P. falciparum is the most severe form and is responsible for most malaria-related deaths globally. And Plasmodium knowlesi is rarely found.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (38 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/saife245/malaria-plasmodium-vivax-specie
π Additional information:
==================================
File count not found
Views: 8,932
Downloads: 460
π RELATED NOTEBOOKS:
==================================
1. Malaria Bounding Boxes | Upvotes: 132
URL: https://www.kaggle.com/datasets/kmader/malaria-bounding-boxes
2. Malaria Detection Dataset | Upvotes: 33
URL: https://www.kaggle.com/datasets/orvile/p-vivax-malaria-infected-human-blood-smears
3. Images preprocessing | Upvotes: 16
URL: https://www.kaggle.com/code/mohamedsalemmohamed/images-preprocessing
4. MP-IDB-YOLO: YOLO-Formatted MP-IDB Malaria Dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/rayhanadi/yolo-formatted-mp-idb-malaria-dataset
5. Starter: Malaria: Plasmodium Vivax 66f0ab62-b | Upvotes: 3
URL: https://www.kaggle.com/code/kerneler/starter-malaria-plasmodium-vivax-66f0ab62-b
==================================
βοΈ By: https://t.me/datasets1
Dataset Name: Used Cars Dataset
Basic Description: Vehicles listings from Craigslist.org
π FULL DATASET DESCRIPTION:
==================================
Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded upon it later to create this dataset which includes every used vehicle entry within the United States on Craigslist.
This data is scraped every few months, it contains most all relevant information that Craigslist provides on car sales including columns like price, condition, manufacturer, latitude/longitude, and 18 other categories. For ML projects, consider feature engineering on location columns such as long/lat. For previous listings, check older versions of the dataset.
See https://github.com/AustinReese/UsedVehicleSearch
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (275 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/austinreese/craigslist-carstrucks-data
π Additional information:
==================================
File count not found
Views: 721,000
Downloads: 103,000
π RELATED NOTEBOOKS:
==================================
1. Automatic Number Plate Recognition | Upvotes: 2,295
URL: https://www.kaggle.com/code/aslanahmedov/automatic-number-plate-recognition
2. Automatic Number Plate Recognition | Upvotes: 419
URL: https://www.kaggle.com/code/mohamedbhy/automatic-number-plate-recognition
3. Used Cars Price Prediction by 15 models | Upvotes: 355
URL: https://www.kaggle.com/code/vbmokin/used-cars-price-prediction-by-15-models
4. USA Housing Listings | Upvotes: 75
URL: https://www.kaggle.com/datasets/austinreese/usa-housing-listings
5. 1.2 Million Used Car Listings | Upvotes: 61
URL: https://www.kaggle.com/datasets/jpayne/852k-used-car-listings
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Vehicles listings from Craigslist.org
π FULL DATASET DESCRIPTION:
==================================
Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded upon it later to create this dataset which includes every used vehicle entry within the United States on Craigslist.
This data is scraped every few months, it contains most all relevant information that Craigslist provides on car sales including columns like price, condition, manufacturer, latitude/longitude, and 18 other categories. For ML projects, consider feature engineering on location columns such as long/lat. For previous listings, check older versions of the dataset.
See https://github.com/AustinReese/UsedVehicleSearch
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (275 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/austinreese/craigslist-carstrucks-data
π Additional information:
==================================
File count not found
Views: 721,000
Downloads: 103,000
π RELATED NOTEBOOKS:
==================================
1. Automatic Number Plate Recognition | Upvotes: 2,295
URL: https://www.kaggle.com/code/aslanahmedov/automatic-number-plate-recognition
2. Automatic Number Plate Recognition | Upvotes: 419
URL: https://www.kaggle.com/code/mohamedbhy/automatic-number-plate-recognition
3. Used Cars Price Prediction by 15 models | Upvotes: 355
URL: https://www.kaggle.com/code/vbmokin/used-cars-price-prediction-by-15-models
4. USA Housing Listings | Upvotes: 75
URL: https://www.kaggle.com/datasets/austinreese/usa-housing-listings
5. 1.2 Million Used Car Listings | Upvotes: 61
URL: https://www.kaggle.com/datasets/jpayne/852k-used-car-listings
==================================
βοΈ By: https://t.me/datasets1
β€4
Get top-tier market analysis: world events meet technical trading.
I'm Michael π. My team and I share our market insights daily on our Telegram channel. Over the past weekend, our strategies delivered up to +39% gains.
We will tell you everything on the channel, even for beginners.
Join the channel below! π
https://t.me/+9mZure1faNRkOTE8
I'm Michael π. My team and I share our market insights daily on our Telegram channel. Over the past weekend, our strategies delivered up to +39% gains.
We will tell you everything on the channel, even for beginners.
Join the channel below! π
https://t.me/+9mZure1faNRkOTE8
β€1
Dataset Name: Animals-10
Basic Description: Animal pictures of 10 different categories taken from google images
π FULL DATASET DESCRIPTION:
==================================
Hello everyone!
This is the dataset I have used for my matriculation thesis.
It contains about 28K medium quality animal images belonging to 10 categories: dog, cat, horse, spyder, butterfly, chicken, sheep, cow, squirrel, elephant.
I have used it to test different image recognition networks: from homemade CNNs (~80% accuracy) to Google Inception (98%). It could simulate a smart gallery for a researcher (like a biologist).
All the images have been collected from "google images" and have been checked by human. There is some erroneous data to simulate real conditions (eg. images taken by users of your app).
The main directory is divided into folders, one for each category. Image count for each category varies from 2K to 5 K units.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (614 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/alessiocorrado99/animals10
π Additional information:
==================================
Total files: 26,200
Views: 424,000
Downloads: 94,600
π RELATED NOTEBOOKS:
==================================
1. πAnimal Image Classification using EfficientNetB7 | Upvotes: 645
URL: https://www.kaggle.com/code/vencerlanz09/animal-image-classification-using-efficientnetb7
2. animal classification | Upvotes: 543
URL: https://www.kaggle.com/code/min4tozaki/animal-classification
3. Grad-CAM: What do CNNs see ? | Upvotes: 229
URL: https://www.kaggle.com/code/quadeer15sh/grad-cam-what-do-cnns-see
4. Animals Detection Images Dataset | Upvotes: 179
URL: https://www.kaggle.com/datasets/antoreepjana/animals-detection-images-dataset
5. Animals-10 | Upvotes: 17
URL: https://www.kaggle.com/datasets/viratkothari/animal10
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Animal pictures of 10 different categories taken from google images
π FULL DATASET DESCRIPTION:
==================================
Hello everyone!
This is the dataset I have used for my matriculation thesis.
It contains about 28K medium quality animal images belonging to 10 categories: dog, cat, horse, spyder, butterfly, chicken, sheep, cow, squirrel, elephant.
I have used it to test different image recognition networks: from homemade CNNs (~80% accuracy) to Google Inception (98%). It could simulate a smart gallery for a researcher (like a biologist).
All the images have been collected from "google images" and have been checked by human. There is some erroneous data to simulate real conditions (eg. images taken by users of your app).
The main directory is divided into folders, one for each category. Image count for each category varies from 2K to 5 K units.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (614 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/alessiocorrado99/animals10
π Additional information:
==================================
Total files: 26,200
Views: 424,000
Downloads: 94,600
π RELATED NOTEBOOKS:
==================================
1. πAnimal Image Classification using EfficientNetB7 | Upvotes: 645
URL: https://www.kaggle.com/code/vencerlanz09/animal-image-classification-using-efficientnetb7
2. animal classification | Upvotes: 543
URL: https://www.kaggle.com/code/min4tozaki/animal-classification
3. Grad-CAM: What do CNNs see ? | Upvotes: 229
URL: https://www.kaggle.com/code/quadeer15sh/grad-cam-what-do-cnns-see
4. Animals Detection Images Dataset | Upvotes: 179
URL: https://www.kaggle.com/datasets/antoreepjana/animals-detection-images-dataset
5. Animals-10 | Upvotes: 17
URL: https://www.kaggle.com/datasets/viratkothari/animal10
==================================
βοΈ By: https://t.me/datasets1
β€2
π₯ $10.000 WITH LISA!
Lisa earned $200,000 in a month, and now itβs YOUR TURN!
Sheβs made trading SO SIMPLE that anyone can do it.
βοΈJust copy her signals every day
βοΈFollow her trades step by step
βοΈEarn $1,000+ in your first week β GUARANTEED!
π¨ BONUS: Lisa is giving away $10,000 to her subscribers!
Donβt miss this once-in-a-lifetime opportunity. Free access for the first 500 people only!
π CLICK HERE TO JOIN NOW π
Lisa earned $200,000 in a month, and now itβs YOUR TURN!
Sheβs made trading SO SIMPLE that anyone can do it.
βοΈJust copy her signals every day
βοΈFollow her trades step by step
βοΈEarn $1,000+ in your first week β GUARANTEED!
π¨ BONUS: Lisa is giving away $10,000 to her subscribers!
Donβt miss this once-in-a-lifetime opportunity. Free access for the first 500 people only!
π CLICK HERE TO JOIN NOW π
β€3
Dataset Name: Malaria Bounding Boxes
Basic Description: P. vivax (malaria) infected human blood smears
π FULL DATASET DESCRIPTION:
==================================
Malaria is a disease caused by Plasmodium parasites that remains a major threat in global health, affecting 200 million people and causing 400,000 deaths a year. The main species of malaria that affect humans are Plasmodium falciparum and Plasmodium vivax.
For malaria as well as other microbial infections, manual inspection of thick and thin blood smears by trained microscopists remains the gold standard for parasite detection and stage determination because of its low reagent and instrument cost and high flexibility. Despite manual inspection being extremely low throughput and susceptible to human bias, automatic counting software remains largely unused because of the wide range of variations in brightfield microscopy images. However, a robust automatic counting and cell classification solution would provide enormous benefits due to faster and more accurate quantitative results without human variability; researchers and medical professionals could better characterize stage-specific drug targets and better quantify patient reactions to drugs.
Previous attempts to automate the process of identifying and quantifying malaria have not gained major traction partly due to difficulty of replication, comparison, and extension. Authors also rarely make their image sets available, which precludes replication of results and assessment of potential improvements. The lack of a standard set of images nor standard set of metrics used to report results has impeded the field.
Images are in .png or .jpg format. There are 3 sets of images consisting of 1364 images (~80,000 cells) with different researchers having prepared each one: from Brazil (Stefanie Lopes), from Southeast Asia (Benoit Malleret), and time course (Gabriel Rangel). Blood smears were stained with Giemsa reagent.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (5 GB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kmader/malaria-bounding-boxes
π Additional information:
==================================
File count not found
Views: 54,400
Downloads: 4,657
π RELATED NOTEBOOKS:
==================================
1. Malaria | YoloV5 | FasterRCNN | Upvotes: 114
URL: https://www.kaggle.com/code/polomarco/malaria-yolov5-fasterrcnn
2. Malaria Bounding Box | Upvotes: 93
URL: https://www.kaggle.com/code/vishnu123/malaria-bounding-box
3. Malaria Preview | Upvotes: 32
URL: https://www.kaggle.com/code/kmader/malaria-preview
4. Malaria Cell Images(Shuffled and Split) | Upvotes: 6
URL: https://www.kaggle.com/datasets/sagnikmazumder37/malaria-cell-imagesshuffled-and-split
5. P.Vivax malaria image dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/jxxn03x/p-vivax-malaria-image-dataset
==================================
βοΈ By: https://t.me/datasets1
Basic Description: P. vivax (malaria) infected human blood smears
π FULL DATASET DESCRIPTION:
==================================
Malaria is a disease caused by Plasmodium parasites that remains a major threat in global health, affecting 200 million people and causing 400,000 deaths a year. The main species of malaria that affect humans are Plasmodium falciparum and Plasmodium vivax.
For malaria as well as other microbial infections, manual inspection of thick and thin blood smears by trained microscopists remains the gold standard for parasite detection and stage determination because of its low reagent and instrument cost and high flexibility. Despite manual inspection being extremely low throughput and susceptible to human bias, automatic counting software remains largely unused because of the wide range of variations in brightfield microscopy images. However, a robust automatic counting and cell classification solution would provide enormous benefits due to faster and more accurate quantitative results without human variability; researchers and medical professionals could better characterize stage-specific drug targets and better quantify patient reactions to drugs.
Previous attempts to automate the process of identifying and quantifying malaria have not gained major traction partly due to difficulty of replication, comparison, and extension. Authors also rarely make their image sets available, which precludes replication of results and assessment of potential improvements. The lack of a standard set of images nor standard set of metrics used to report results has impeded the field.
Images are in .png or .jpg format. There are 3 sets of images consisting of 1364 images (~80,000 cells) with different researchers having prepared each one: from Brazil (Stefanie Lopes), from Southeast Asia (Benoit Malleret), and time course (Gabriel Rangel). Blood smears were stained with Giemsa reagent.
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (5 GB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/kmader/malaria-bounding-boxes
π Additional information:
==================================
File count not found
Views: 54,400
Downloads: 4,657
π RELATED NOTEBOOKS:
==================================
1. Malaria | YoloV5 | FasterRCNN | Upvotes: 114
URL: https://www.kaggle.com/code/polomarco/malaria-yolov5-fasterrcnn
2. Malaria Bounding Box | Upvotes: 93
URL: https://www.kaggle.com/code/vishnu123/malaria-bounding-box
3. Malaria Preview | Upvotes: 32
URL: https://www.kaggle.com/code/kmader/malaria-preview
4. Malaria Cell Images(Shuffled and Split) | Upvotes: 6
URL: https://www.kaggle.com/datasets/sagnikmazumder37/malaria-cell-imagesshuffled-and-split
5. P.Vivax malaria image dataset | Upvotes: 4
URL: https://www.kaggle.com/datasets/jxxn03x/p-vivax-malaria-image-dataset
==================================
βοΈ By: https://t.me/datasets1
β€7
Dataset Name: COVID-19 Dataset
Basic Description: Number of Confirmed, Death and Recovered cases every day across the globe
π FULL DATASET DESCRIPTION:
==================================
(opens in a new tab)"> (opens in a new tab)">
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (20 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/imdevskp/corona-virus-report
π Additional information:
==================================
File count not found
Downloads: 387,000
π RELATED NOTEBOOKS:
==================================
1. COVID-19 - Analysis, Visualization & Comparisons | Upvotes: 7,330
URL: https://www.kaggle.com/code/imdevskp/covid-19-analysis-visualization-comparisons
2. Novel Corona Virus 2019 Dataset | Upvotes: 6,275
URL: https://www.kaggle.com/datasets/sudalairajkumar/novel-corona-virus-2019-dataset
3. COVID-19: Digging a Bit Deeper | Upvotes: 883
URL: https://www.kaggle.com/code/abhinand05/covid-19-digging-a-bit-deeper
4. COVID-19 - Temperature, Air Travel & Transmission | Upvotes: 489
URL: https://www.kaggle.com/code/sixteenpython/covid-19-temperature-air-travel-transmission
5. COVID-19 Coronavirus Dataset | Upvotes: 47
URL: https://www.kaggle.com/datasets/vignesh1694/covid19-coronavirus
==================================
βοΈ By: https://t.me/datasets1
Basic Description: Number of Confirmed, Death and Recovered cases every day across the globe
π FULL DATASET DESCRIPTION:
==================================
(opens in a new tab)"> (opens in a new tab)">
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (20 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/imdevskp/corona-virus-report
π Additional information:
==================================
File count not found
Downloads: 387,000
π RELATED NOTEBOOKS:
==================================
1. COVID-19 - Analysis, Visualization & Comparisons | Upvotes: 7,330
URL: https://www.kaggle.com/code/imdevskp/covid-19-analysis-visualization-comparisons
2. Novel Corona Virus 2019 Dataset | Upvotes: 6,275
URL: https://www.kaggle.com/datasets/sudalairajkumar/novel-corona-virus-2019-dataset
3. COVID-19: Digging a Bit Deeper | Upvotes: 883
URL: https://www.kaggle.com/code/abhinand05/covid-19-digging-a-bit-deeper
4. COVID-19 - Temperature, Air Travel & Transmission | Upvotes: 489
URL: https://www.kaggle.com/code/sixteenpython/covid-19-temperature-air-travel-transmission
5. COVID-19 Coronavirus Dataset | Upvotes: 47
URL: https://www.kaggle.com/datasets/vignesh1694/covid19-coronavirus
==================================
βοΈ By: https://t.me/datasets1
β€1
Dataset Name: FIFA23 OFFICIAL DATASET
Basic Description: From FIFA17 to FIFA23 statistics for each football player
π FULL DATASET DESCRIPTION:
==================================
The dataset contains +17k unique players and more than 60 columns, general information and all KPIs the famous videogame offers. As the esport scene keeps rising espacially on FIFA, I thought it can be useful for the community (kagglers and/or gamers)
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (14 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/bryanb/fifa-player-stats-database
π Additional information:
==================================
File count not found
Views: 107,000
Downloads: 66,500
π RELATED NOTEBOOKS:
==================================
1. FIFA 22 complete player dataset | Upvotes: 419
URL: https://www.kaggle.com/datasets/stefanoleone992/fifa-22-complete-player-dataset
2. FIFA 21 complete player dataset | Upvotes: 197
URL: https://www.kaggle.com/datasets/stefanoleone992/fifa-21-complete-player-dataset
3. FIFA 22 Players Dataset | Upvotes: 42
URL: https://www.kaggle.com/datasets/minhnguyen147/fifa-22-players-dataset
4. FIFA 21 Player Comparison using Radar Chart | Upvotes: 33
URL: https://www.kaggle.com/code/abhijithchandradas/fifa-21-player-comparison-using-radar-chart
5. Using FIFA to Predict FPL score | Upvotes: 20
URL: https://www.kaggle.com/code/scientistdat/using-fifa-to-predict-fpl-score
==================================
βοΈ By: https://t.me/datasets1
Basic Description: From FIFA17 to FIFA23 statistics for each football player
π FULL DATASET DESCRIPTION:
==================================
The dataset contains +17k unique players and more than 60 columns, general information and all KPIs the famous videogame offers. As the esport scene keeps rising espacially on FIFA, I thought it can be useful for the community (kagglers and/or gamers)
π₯ DATASET DOWNLOAD INFORMATION
==================================
π΄ Dataset Size: Download dataset as zip (14 MB)
π° Direct dataset download link:
https://www.kaggle.com/api/v1/datasets/download/bryanb/fifa-player-stats-database
π Additional information:
==================================
File count not found
Views: 107,000
Downloads: 66,500
π RELATED NOTEBOOKS:
==================================
1. FIFA 22 complete player dataset | Upvotes: 419
URL: https://www.kaggle.com/datasets/stefanoleone992/fifa-22-complete-player-dataset
2. FIFA 21 complete player dataset | Upvotes: 197
URL: https://www.kaggle.com/datasets/stefanoleone992/fifa-21-complete-player-dataset
3. FIFA 22 Players Dataset | Upvotes: 42
URL: https://www.kaggle.com/datasets/minhnguyen147/fifa-22-players-dataset
4. FIFA 21 Player Comparison using Radar Chart | Upvotes: 33
URL: https://www.kaggle.com/code/abhijithchandradas/fifa-21-player-comparison-using-radar-chart
5. Using FIFA to Predict FPL score | Upvotes: 20
URL: https://www.kaggle.com/code/scientistdat/using-fifa-to-predict-fpl-score
==================================
βοΈ By: https://t.me/datasets1
β€2π1