Research is done to retrieve useful information from files containing structured data to
produce consolidated reports based on the campus crimes. It is mainly focused on
implementation of business logic in stored procedures using XQuery for querying XML
documents. Research was involved in the implementation of the Apriori algorithm [2] and the
integration of the results returned from the stored procedures. The significance of Standard
Data source template had to be understood well. The use of results in this template form was
necessary as an input to the Apriori algorithm as the Apriori algorithm requires the input data
3
to be supplied in a transaction format. Another important research was to make the data
retrieval fast and efficient due to large volumes of data spread among multiple files.
Therefore the implementation of XQuery following good coding practices and fine tuning of
the stored procedures is done. The web application user interface had to be developed as a
responsive UI so that it can be rendered on any device such as laptop, tablet or a mobile
phone. This involved understanding the jQuery wrapper to implement the responsive UI. The
ASP.net charting controls documentation had to be understood in order to implement the
configuration settings for displaying data on charts. In addition, the CSC 177 courseware
helped me in understanding the Apriori algorithm as well as important concepts in Data
warehousing and Data mining.
The report is organized as follows: Chapter 1 contains introduction to the project. Chapter 2
contains some background information on the basic concepts involved in data warehousing
and data mining such as the ETL process, Apriori algorithm and the technology used in the
application development. Chapter 3 discusses the design details involved in the
implementation of the web based data analysis tool. Chapter 4 contains the implementation of
the web application architecture, creation of the database schema, the development of stored
procedures using XQuery and implementation of the Apriori algorithm to accept input data
based on SDST format implemented in stored procedures. Chapter 5 describes about the
lessons learned, conclusions reached and the future enhancements to the application.