Such an assumption and the consequent selection of neighbour can be sub-optimal This may lead to an inaccurate estimation, especially for those target genes situate by the edge of a cluster. The problem can be resolved by bringing information regarding data clusters into the imputation process, which is the basis of cluster-based KNN impute (CKNN) [11].