For empirical software studies, data collection is problematic, in part because, as noted above, software measures are not well-defined. Kitchenham et al. [22] discuss many of the problems with data collection; they suggest several standards for defining and using software measures. From the perspectives of design and data collection, the non-standard nature of software measures makes it difficult for researchers to replicate studies or to