It is defined as a process to transform the source data into a suitable form, which can be used for
querying and data analysis. Transformation may include cleaning of the data and perform lookup
operations to create lookup tables as well as transformation of data from one data type to the
other. For instance, in this application the only manual transformation required is the
transformation of the Excel files into XML files. Since the data is represented in the semistructured
format in XML files, no transformation for any attributes is required