After raw data is collected data is preprocessed. We do preprocessing
to make data suitable for data mining. It may include
cleaning of data, transformation of variables, selection of
features and data balancing. When data is collected it may have
some incomplete records, inconsistent values. Pre-processing
enable us to get data in a form on which data mining and
analysis can be done efficiently