Statistical-based reconstruction methods for speech recognition in IP networks

A. M. Gómez, A. M. Peinado, V. Sánchez, B. P. Milner, A. J. Rubio

Research output: Contribution to conferencePaper

Abstract

This work shows the performance of statistical-based reconstruction techniques when a burst-like packet loss network is used to transmit speech feature vectors on a DSR architecture. Two different approaches to exploit prior information about the speech are outlined. The first models the sequence of quantized vectors through transition probabilities to make estimations based on data-source information, while the second uses prior knowledge of the means and covariances of the feature vector stream to make a maximum a-posteriori (MAP) estimate of lost vectors. These methods provide better results than those obtained by the AURORA nearest repetition, especially in the presence of bursts of losses. However, they require either a notable amount of memory or a high time complexity. Therefore, a novel solution based on the previous methods is proposed and evaluated.
Original languageEnglish
Publication statusPublished - Aug 2004
EventCOST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction (Robust2004) - University of East Anglia, Norwich, United Kingdom
Duration: 30 Aug 200431 Aug 2004

Conference

ConferenceCOST278 and ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction (Robust2004)
Country/TerritoryUnited Kingdom
CityNorwich
Period30/08/0431/08/04

Cite this