Abstract
This work looks at the issues involved in performing robust speech recognition over a packet-based network such as the IP network. This involves the combination of robust speech recognition together with a reliable method of sending speech data over the IP network. The format in which the speech is sent over the network is considered and results show that much better robustness is achieved when the front-end features are transmitted directly rather than encoding the speech with a codec. The problem of packet loss is addressed and a novel detection and estimation scheme for missing frames is introduced to overcome this problem. This is shown to recover performance with 50% packet loss from 33% to 90% which is only 3% below the no loss case
Original language | English |
---|---|
Pages | 1791-1794 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 2000 |
Event | IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) - Istanbul, Turkey Duration: 5 Jun 2000 → 9 Jun 2000 |
Conference
Conference | IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) |
---|---|
Country/Territory | Turkey |
City | Istanbul |
Period | 5/06/00 → 9/06/00 |