IamPython – Telegram

IamPython

290 subscribers

148 photos

13 videos

8 files

195 links

This is Python based telegram group for web developers, Artificial intelligence, webscraping, Datascience, Data analysis, Ethical Hacking and more. You will learn lot insights and useful information

Download Telegram

About

Blog

Apps

Platform

290 subscribers

IBM Cloud
◦ Cloud Functions - 5 million executions per month
◦ Object Storage - 25GB per month
◦ Cloudant database - 1 GB of data storage
◦ Db2 database - 100MB of data storage
◦ API Connect - 50,000 API calls per month
◦ Availability Monitoring - 3 million data points per month
◦ Log Analysis - 500MB of daily log

345 views18:39

Oracle Cloud
◦ Compute - 2 VM.Standard.E2.1.Micro 1GB RAM
◦ Block Volume - 2 volumes, 100 GB total (used for compute)
◦ Object Storage - 10 GB
◦ Load balancer - 1 instance with 10 Mbps
◦ Databases - 2 DBs, 20 GB each
◦ Monitoring - 500 million ingestion datapoints, 1 billion retrieval datapoints
◦ Bandwidth - 10TB egress per month
◦ Notifications - 1 million delivery options per month, 1000 emails sent per month

357 views18:40

PyTorch 1.8 Release with native AMD support!

281 views13:40

- 611 datasets you can download in one line of python
- 467 languages covered, 99 with at least 10 datasets
- efficient pre-processing to free you from memory constraints

https://github.com/huggingface/datasets

GitHub - huggingface/datasets: 🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data…

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools - huggingface/datasets

288 views15:52

Q: Shall we remove the duplicate records (i.e. records with exactly the same features) from the dataset before training an ML model?

A: It depends. If the duplicated records belong to a single instance/event (e.g. when one instance is captured twice), they should be removed. For example, by looking at the customer_IDs, we may notice some of the customers are duplicated in our data. In this case, we should deduplicate. Otherwise, the ML model cannot estimate the prior probability distribution correctly.

On the other hand, if the records with the same features belong to different instances/events, we should keep them. For example, if two customers have the same age, sex, balance, and etc, their data should be used to train the model.

To have a better understanding, consider a Naive Bayes model for a classification problem. By removing the samples with the same features, the model misestimates the prior probabilities that eventually affects the output.

Intuitively, the model needs to know the frequency/distribution of those duplicated records.

306 views18:08

Automated workplace #ergonomics assessment using motion capture to remove risk factors that lead to musculoskeletal injuries (#MSD) and to help human performance and #productivity. Ergo simulation software by Nawo Solution & Pierre FOUBERT Wilo Group

292 views18:19

This media is not supported in your browser

VIEW IN TELEGRAM

300 views18:19

382 views07:13

Stats for Datascience and Machine Learning

379 views07:13

Actually, here are some sites where I have found some of the highest quality, free machine learning educational content:

🔹GitHub
🔹Kaggle
🔹Coursera
🔹YouTube
🔹Papers with Code
🔹fast.ai
🔹PyImageSearch
🔹Machine Learning Mastery
🔹Wikipedia

391 views16:40

276 views18:14

In the last 10 years, AI-related PhDs have gone from 14.2% of the total of CS PhDs granted in the U.S. to around 23% as of 2019, according to the CRA survey. At the same time, other previously popular CS PhDs have declined in popularity, including networking, software engineering, and programming

282 views18:15

GenoML: Automated Machine Learning for Genomics
pdf: arxiv.org/pdf/2103.03221…
abs: arxiv.org/abs/2103.03221
project page: genoml.com

285 views08:30

How to Automate Exploratory Data Analysis (EDA) ? - Part 1 https://youtu.be/tMquUTJ6yXU

You should know when you want to expedite data analysis 🧐 I strongly recommend you to use in your real world problems. This module will help you a lot

Automate Exploratory Data Analysis (EDA) #Part 1

EDA is performed to visualize what data is telling us before implementing any formal modelling or creating a hypothesis testing model. There are some analysi...

307 views15:37

This website will help you learn probability and statistics, the most important topics in math for machine learning!

seeing-theory.brown.edu

Don’t forget to add in bookmarks 🔖

299 views06:50

389 views07:01

Channel name was changed to «Python Developers / Machine Learning / DataScience / AI»

11:31

Deep learning activation functions made cool and cute

280 views02:01

Learning path to mastering data engineering:

🔸 SQL
🔸 Git
🔸 Bash
🔸 PostgreSQL
🔸 Java, Scala
🔸 Python
🔸 Docker
🔸 AWS
🔸 Airflow
🔸 Kafka
🔸 Spark
🔸 Kubernetes

298 views02:02