where wi,j denotes the weight of the connection from node j to i and o the output node. The performance is sensitive to the topology choice (H). A NN with H = 0 is equivalent to the MR model. By increasing H, more complex mappings can be performed, yet an excess value of H will overfit the data, leading to generalization