In this paper, we describe the design of an ASR system that is based on identifying and extracting formulaic phrases from a corpus and then, rather than building statistical models of them, performing example-based recognition of these phrases. We describe a method for combining formulaic phrases into a bigram language model that results in a 13% decrease in WER on a monophone HMM recogniser over the baseline. We show that using this model with phrase templates in the example-based recogniser gives a significant improvement in WER compared to word templates, but performance still falls short of the HMM recogniser. We also describe an LDA decision tree classifier that reduces the search space of the DTW decoder by 40% while at the same time decreasing WER.
|Number of pages||4|
|Publication status||Published - Sep 2009|
|Event||10th Annual Conference of the International Speech Communication Association (INTERSPEECH) - Brighton, United Kingdom|
Duration: 6 Sep 2009 → 10 Sep 2009
|Conference||10th Annual Conference of the International Speech Communication Association (INTERSPEECH)|
|Period||6/09/09 → 10/09/09|