Abstract
We address the problem of predicting pauses between the words in a sentence, which is of considerable interest for text to speech systems. In doing so, we show that the performance of both a generative classifier (naive Bayes, NB) and a discriminative classifier (maximum entropy, ME) can be significantly enhanced by application of the generalised probabilistic descent (GPD) algorithm. The features used for prediction of pauses in sentences are both local (derived from the neighbourhood of a word juncture) and global (derived from a parse tree of the sentence). We first compare the results of using the NB and ME classifiers on these features, and then develop the theory required for applying GPD to these classifiers. We show that GPD is particularly suitable for application within the maximum entropy framework and increases very significantly the discriminative power of both the NB and ME classifiers. The F-score of 81.2% obtained after application of GPD to an ME classifier is believed to be the best performance obtained on the Boston Radio Corpus.
Original language | English |
---|---|
Pages | 3229-3232 |
Number of pages | 4 |
Publication status | Published - Sep 2005 |
Event | 9th European Conference on Speech Communication and Technology - Lisbon, Portugal Duration: 4 Sep 2005 → 8 Sep 2005 |
Conference
Conference | 9th European Conference on Speech Communication and Technology |
---|---|
Abbreviated title | INTERSPEECH-2005 |
Country/Territory | Portugal |
City | Lisbon |
Period | 4/09/05 → 8/09/05 |