The problem
with imbalanced data arises because learning algorithms tend
to overlook less frequent classes (minority classes) and only
pay attention to the most frequent ones (majority classes). As
a result, the classifier obtained will not be able to correctly
classify data instances corresponding to poorly represented
classes.