a) What are the conditions for consistency of the ERM
(empirical risk minimization) principle?
b) How fast does the sequence of smallest empirical risk
values converge to the smallest actual risk?
c) How can one control the rate of convergence (the rate of
generalization) of the learning machine?
d) How can one construct algorithms that can control the
rate of generalization?