AI, Python, Cognitive Neuroscience
3.83K subscribers
1.09K photos
46 videos
78 files
891 links
Download Telegram
#imbalancedData
What is it?
Ans-> Suppose, you are having a Classification problem with 2M records. The Output variable is having 2 categories (Yes- 500, No- 1.99M or more).

This is the imbalanced data, as one category is far less than the other category in the Output variable.

Examples-> Credit Card fraud, Cancer Detection(or any other disease that is severe), and many more.

How to deal with it?
1) Undersampling
2) Oversampling

#datascience #dataanalysis #learning

✴️ @AI_Python_EN
Great Statistical software for Beginners.

Here is the Gretl Tutorial by Simone Gasperin

1)Simple Linear Regression
https://lnkd.in/ecfsV9c

2)Coding Dummy Variables
https://lnkd.in/ef7Yd7f

3)Forecasting New Observations
https://lnkd.in/eNKbxbU

4)Forecasting a Large Number of Observations
https://lnkd.in/eHmibGs

5)Logistic Regression
https://lnkd.in/eRfhQ87

6)Forecasting and Confusion Matrix
https://lnkd.in/eaqrFJr

7)Modeling and Forecasting Time Series Data
https://lnkd.in/e6fqKpF

8)Comparing Time Series Trend Models
https://lnkd.in/eKjEUAE

#datascience #machinelearning #statistics #dataanalytics #dataanalysis

✴️ @AI_Python_EN