Calculate the entropy of every attribute using the data set S
Split the set S into subsets using the attribute for which entropy is minimum (or, equivalently, information gain is maximum)
Make a decision tree node containing that attribute
Recur on subsets using remaining attributes.