Rigler et al. (1991) have argued that in networks containing multiple weight layers, the changes to the weights in each layer are scaled disproportionately by the gradient term
Rigler et al. (1991) have argued that in networks containing multiple weight layers, the changes to the weights in each layer are scaled disproportionately by the gradient term