For 100-words dataset, the experiment comprises of 1000 samples (100 aud io fIles for each speaker with a total of ten speakers). We adopt a 10 fold cross validation approach and calculate the mean and variance for accuracy and WER. The results are reported in Table II. Accuracy and Word Error Rate (WER) are calculated as: