We also study which classification algorithms perform
the best on classifying UML CDs. To do that, we calculate
and compare their classification performance based on the
two measures specificity and sensitivity. Amongst these two
measures, specificity is in our case considered more important
than sensitivity. Logistic Regression is found to produce
the highest correct predication rate, at 91.4%, on identifying
non-UML CDs.