5.1. Data preparation
The dataset used for this study was obtained from the Mathematics Department in the College of Sciences. It
included the records of mathematics graduate students from 2008‒2014. The study received official approval to
copy these records from the Dean of Admissions and Registration. The main problem with the dataset was that data
were stored in a non-understandable format, were stored in multiple files, and were in Arabic. In addition, the data
included many irrelevant courses and multiple unnecessary details such as the graduation year.