Soft Decoding of Temporal Derivatives for Robust Distributed Speech Recognition in Packet Loss

A. B. James, B. P. Milner

Research output: Contribution to conferencePaper

5 Citations (Scopus)

Abstract

The aim of this work is to improve distributed speech recognition accuracy in packet loss by considering the effect of loss on the temporal derivatives of the feature vector. Analysis of temporal derivatives reveals they suffer severe distortion when static vectors are lost in times of packet loss. The application of missing feature theory and soft-decoding techniques are considered for compensating against packet loss at the decoding stage of recognition. An extension to these methods is developed which considers the static, velocity and acceleration components separately. A series of confidence measures for the temporal derivatives is devised and applied within the soft-decoding framework. Experimental results on both a connected digit task and a large vocabulary task demonstrate significant increases in recognition accuracy under a range of packet loss conditions.
Original languageEnglish
Pages345-348
Number of pages4
DOIs
Publication statusPublished - Mar 2005
EventIEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) - Philadelphia, United States
Duration: 18 Mar 200523 Mar 2005

Conference

ConferenceIEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
Country/TerritoryUnited States
CityPhiladelphia
Period18/03/0523/03/05

Cite this