To evaluate our approach, we use the top-k accuracy and the Mean Reciprocal Rank (MRR). These metrics are commonly used in recommendation systems for software engineering [28, 39, 40]. Since most of reviews have only one code-reviewer (cf. Table I), other evaluation metrics (e.g. Mean Average Precision) that consider all of the correct answer might not be appropriate for this evaluation.