Combinations of decision tree with other learning algorithms have been studied
in various ways before. An early example of a hybrid decision tree algorithm is
presented by Utgoff (1988). Here, a decision tree learner is introduced that uses
linear threshold units at the leaf nodes; however, pruning is not considered as
the algorithm was expected to work only on noise-free domains.