Bayesian channel equalisation and robust features for speech recognition

B. P. Milner, S. V. Vaseghi

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

The use of a speech recognition system with telephone channel environments, or different microphones, requires channel equalisation. In speech recognition, the speech model provides a bank of statistical information that can be used in the channel identification and equalisation process. The authors consider HMM-based channel equalisation, and present results demonstrating that substantial improvement can be obtained through the equalisation process. An alternative method, for speech recognition, is to use a feature set which is more robust to channel distortion. Channel distortions result in an amplitude tilt of the speech cepstrum, and therefore differential cepstral features provide a measure of immunity to channel distortions. In particular the cepstral-time feature matrix, in addition to providing a framework for representing speech dynamics, can be made robust to channel distortions. The authors present results demonstrating that a major advantage of cepstral-time matrices is their channel insensitive character
Original languageEnglish
Pages (from-to)223-231
Number of pages9
JournalIEE Proceedings: Vision, Image and Signal Processing
Volume143
Issue number4
DOIs
Publication statusPublished - 1996

Cite this