However, in spite of their effectiveness in classifying
short-time units such as individual phones and isolated words, neural networks are rarely
successful for continuous recognition tasks, largely because of their lack of ability to
model temporal dependencies. Thus, one alternative approach is to use neural networks
as a pre-processing e.g. feature transformation, dimensionality reduction, for the HMM
based recognition.