Conclusions
In this paper, we systematically studied the impact of noise
on missing value imputation methods when noise and missing
values distributed throughout the dataset. By observing
the behavior of the different imputation methods at different
noise levels, we drew the conclusion that noise has
great negative effects on imputation methods, especially
when the noise level is high. Meanwhile, we designed a
robust method RIBG based on GMDH to impute missing
values in noisy environment. Comparative studies have
shown that RIBG performs quite well in comparison with
other four popular imputation methods in the presence of
noise. Given the frequent occurrence of missing values and
noise, RIBG is a good choice in imputing incomplete data
and has great potential in real-world data mining applications.