#imbalancedData
What is it?
Ans-> Suppose, you are having a Classification problem with 2M records. The Output variable is having 2 categories (Yes- 500, No- 1.99M or more).
This is the imbalanced data, as one category is far less than the other category in the Output variable.
Examples-> Credit Card fraud, Cancer Detection(or any other disease that is severe), and many more.
How to deal with it?
1) Undersampling
2) Oversampling
#datascience #dataanalysis #learning
✴️ @AI_Python_EN
What is it?
Ans-> Suppose, you are having a Classification problem with 2M records. The Output variable is having 2 categories (Yes- 500, No- 1.99M or more).
This is the imbalanced data, as one category is far less than the other category in the Output variable.
Examples-> Credit Card fraud, Cancer Detection(or any other disease that is severe), and many more.
How to deal with it?
1) Undersampling
2) Oversampling
#datascience #dataanalysis #learning
✴️ @AI_Python_EN