Locally Predictive Features
Because correlations are estimated globally (over all
training instances), CFS tends to select a core" subset
of features that has low redundancy and is strongly
predictive of the class. In some cases however, there
may be subsidiary features that are locally predictive
in a small area of the instance space. CFS includes a
heuristic to include locally predictive features and avoid
the re-introduction of redundancy.