Therefore,
some preprocessing tasks are needed to reduce the noisy
data of Web log files before applying the pattern
discovery techniques to find the relationship between the
log files. Apart from the volume of the data and its low
quality, the data is not completely structured. It is in a
semi-structured format so that it needs a lot of
preprocessing and parsing before the actual extraction of
the required information