Data are transformed or consolidated into forms appropriate for mining
Data transformation can involve the following:
Smoothing
Works to remove the noise from data
Techniques include binning, clustering and regression
Aggregation
Summary or aggregation operations are applied to the data
Data cube construction
Generalization
Replace low-level concept with the higher-level concepts through the use of concept hierarchies
Ex age young, middle-age or senior