Data preprocessing is one of the important and prerequisite phases in WUM. This paper presents a brief introduction to WUM, apart from the data mining technologies and also the implementation of the preprocessing of web log files in NASA’s web server. This study focuses on methods that can be used for the task of session identification from web log file. The work in this study also produces statistical information of user session.
After preprocessing completed, the result will be used for mining user access pattern, The future work involves various data transformation tasks that are likely to influence the quality of the discovered patterns resulting from the mining techniques like Association, Clustering, and Classification can be applied only on to the group of sessions according to assumptions of users’ intentions.