#imbalancedData
What is it?
Ans-> Suppose, you are having a Classification problem with 2M records. The Output variable is having 2 categories (Yes- 500, No- 1.99M or more).
This is the imbalanced data, as one category is far less than the other category in the Output variable.
Examples-> Credit Card fraud, Cancer Detection(or any other disease that is severe), and many more.
How to deal with it?
1) Undersampling
2) Oversampling
#datascience #dataanalysis #learning
✴️ @AI_Python_EN
What is it?
Ans-> Suppose, you are having a Classification problem with 2M records. The Output variable is having 2 categories (Yes- 500, No- 1.99M or more).
This is the imbalanced data, as one category is far less than the other category in the Output variable.
Examples-> Credit Card fraud, Cancer Detection(or any other disease that is severe), and many more.
How to deal with it?
1) Undersampling
2) Oversampling
#datascience #dataanalysis #learning
✴️ @AI_Python_EN
Great Statistical software for Beginners.
Here is the Gretl Tutorial by Simone Gasperin
1)Simple Linear Regression
https://lnkd.in/ecfsV9c
2)Coding Dummy Variables
https://lnkd.in/ef7Yd7f
3)Forecasting New Observations
https://lnkd.in/eNKbxbU
4)Forecasting a Large Number of Observations
https://lnkd.in/eHmibGs
5)Logistic Regression
https://lnkd.in/eRfhQ87
6)Forecasting and Confusion Matrix
https://lnkd.in/eaqrFJr
7)Modeling and Forecasting Time Series Data
https://lnkd.in/e6fqKpF
8)Comparing Time Series Trend Models
https://lnkd.in/eKjEUAE
#datascience #machinelearning #statistics #dataanalytics #dataanalysis
✴️ @AI_Python_EN
Here is the Gretl Tutorial by Simone Gasperin
1)Simple Linear Regression
https://lnkd.in/ecfsV9c
2)Coding Dummy Variables
https://lnkd.in/ef7Yd7f
3)Forecasting New Observations
https://lnkd.in/eNKbxbU
4)Forecasting a Large Number of Observations
https://lnkd.in/eHmibGs
5)Logistic Regression
https://lnkd.in/eRfhQ87
6)Forecasting and Confusion Matrix
https://lnkd.in/eaqrFJr
7)Modeling and Forecasting Time Series Data
https://lnkd.in/e6fqKpF
8)Comparing Time Series Trend Models
https://lnkd.in/eKjEUAE
#datascience #machinelearning #statistics #dataanalytics #dataanalysis
✴️ @AI_Python_EN