A Comparison of Decision Tree Classifiers for Automatic Diagnosis of Speech Recognition Errors

keywords: Decision trees, classification, fault diagnosis, speech recognition
Present speech recognition systems are becoming more complex due to technology advances, optimizations and special requirements such as small computation and memory footprints. Proper handling of system failures can be seen as a kind of fault diagnosis. Motivated by the success of decision tree diagnosis in other scientific fields and by their successful application in speech recognition in the last decade, we contribute to the topic mainly in terms of comparison of different types of decision trees. Five styles are examined: CART (testing three different splitting criteria), C4.5, and then Minimum Message Length (MML), strict MML and Bayesian styles decision trees. We apply these techniques to data of computer speech recognition fed by intrinsically variable speech. We conclude that for this task, CART technique outperforms C4.5 in terms of better classification for ASR failures.
mathematics subject classification 2000: 68T10: Pattern recognition, speech recognition; 62H30: Classification and discrimination; cluster analysis; 62C05: General considerations; 62C12: Empirical decision procedures; empirical Bayes procedures
reference: Vol. 29, 2010, No. 3, pp. 489–501