On Estimation of a Speaker's Confusion Matrix from Sparse Data

Research output: Contribution to conferencePaper

2 Citations (Scopus)


Confusion matrices have been widely used to increase the accuracy of speech recognisers, but usually a mean confusion matrix, averaged over many speakers, is used. However, analysis shows that confusion matrices for individual speakers vary considerably, and so there is benefit in obtaining estimates of confusion matrices for individual speakers. Unfortunately, there is rarely enough data to make reliable estimates. We present a technique for estimating the elements of a speaker's confusion matrix given only sparse data from the speaker. It utilizes non-negative matrix factorisation to find structure within confusion matrices, and this structure is exploited to make improved estimates. Results show that under certain conditions, this technique can give estimates that are as good as those obtained with twice the number of utterances available from the speaker.
Original languageEnglish
Number of pages4
Publication statusPublished - Sep 2008
Event9th Annual Conference of the International Speech Communication Association (INTERSPEECH) - Brisbane, Australia
Duration: 22 Sep 200826 Sep 2008


Conference9th Annual Conference of the International Speech Communication Association (INTERSPEECH)

Cite this