AI, Python, Cognitive Neuroscience
3.87K subscribers
1.09K photos
47 videos
78 files
893 links
Download Telegram
#imbalancedData
What is it?
Ans-> Suppose, you are having a Classification problem with 2M records. The Output variable is having 2 categories (Yes- 500, No- 1.99M or more).

This is the imbalanced data, as one category is far less than the other category in the Output variable.

Examples-> Credit Card fraud, Cancer Detection(or any other disease that is severe), and many more.

How to deal with it?
1) Undersampling
2) Oversampling

#datascience #dataanalysis #learning

✴️ @AI_Python_EN