AI, Python, Cognitive Neuroscience
3.88K subscribers
1.09K photos
47 videos
78 files
893 links
Download Telegram
It is time we shared the dataset with everyone. This is a collection of text from Tamil news articles. Has around 7 millions lines of text, all cleaned up, ready to used for language modelling task, in case anyone want to try. You can use the code from git repo below to get started.

Dataset:
https://lnkd.in/fzg3xyM]
Code:
https://lnkd.in/fezt4M8 #datasets

โœด๏ธ @AI_Python_EN
To much spelling error in your dataset?


Peter Norvig (Research Director at Google, previously director of search quality) revolutionize search engine quality by giving power to reduce spelling error (by splits, deletes, transposes, replaces, and inserts). You can see the comprehensive guide (with python code) at his website https://lnkd.in/fEb3v2a

#python #datasets #codes #statistician

โœด๏ธ @AI_Python_EN
Not everyone knows but my #book has its Github repository where all #Python code used to build illustrations is gathered.

So, while reading the book, you can actually run the described #algorithms, play with hyperparameters and #datasets, and generate your versions of illustrations.

https://github.com/aburkov/theMLbook

โœด๏ธ @AI_Python_EN
image_2019-04-16_16-17-24.png
710.5 KB
Transition guide from Excelโ€™s analyst to Python Programming for Data Analysis

1. From Excel to Pandas https://lnkd.in/fnU5apw
2. Communication & Data Storytelling https://lnkd.in/eqf5gUV
3. Data Manipulation with Python https://lnkd.in/g4DFNpJ
4. Data Visualization with Python (Matplotlib/Seaborn): https://lnkd.in/g_3fx_6
5. Advanced Pandas https://lnkd.in/fZWGp9B
6. Tricks on Pandas by Real Python https://lnkd.in/fXc9XSp
7. Becoming Efficient with Pandas https://lnkd.in/f64hU-Y
8. Pandas Advances Tips https://lnkd.in/fGyBc4c
9. Jupyter Notebook (Beginner) https://lnkd.in/fTFinFi
10. Jupyter Notebook (Advanced) https://lnkd.in/fFufePv

#datavisualization #python #programming #pydata #datasets #pandas #datasets

โœด๏ธ @AI_Python_EN
Tools like #PyTorch, fast.ai, and open source #datasets are making deep learning faster and more accessible. Learn how one #ML hobbyist used these resources to train a convolutional neural network that can classify gastrointestinal images.

๐ŸŒŽ Link

โœด๏ธ @AI_Python
25 Excellent #MachineLearning Open #Datasets

๐ŸŒŽ more learn


โœด๏ธ @AI_Python_EN
#Statistics such as correlation, mean and standard deviation (variance) create strong visual images and meaning. Two different #datasets with the same correlation would sort of look the same. Right?

Not so much.

Each of these very different-looking graphs are plotting datasets with the same correlation, mean and SD. This is why plotting data is so important though oddly so rarely (in my expereince) done.

https://bit.ly/2oZ29MP

โœด๏ธ @AI_Python_EN
Another lovely development in #Healthcare #DeepLearning

Building a Benchmark Dataset and Classifiers for Sentence-Level Findings in AP Chest X-rays.

#datasets
Arxiv: https://lnkd.in/dxx5iCY

โœด๏ธ @AI_Python_EN
Google announced the updated YouTube-8M dataset

Updated set now includes a subset with verified 5-s segment level labels, along with the 3rd Large-Scale Video Understanding Challenge and Workshop at #ICCV19.

Link: https://lnkd.in/f_6Jb7Y

#DL #datasets

โœด๏ธ @AI_Python_EN
Public datasets: weather and climate Google Cloudโ€™s Public Datasets Program :
https://lnkd.in/edhe7wj

#ArtificialIntelligence #Datasets

โœด @AI_Python_EN