Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model

X. Shao, B. P. Milner

Research output: Contribution to conferenceOther

9 Citations (Scopus)

Abstract

This paper extends the technique of speech reconstruction from MFCC by considering the effect of noisy speech. To reconstruct a clean speech signal from noise contaminated MFCC an estimate of the clean mel-filterbank vector is required together with a robust estimate of the pitch. This work applies spectral subtraction to the mel-filterbank vector (derived from noisy MFCC) to provide a clean speech spectral estimate. To obtain a reliable estimate of pitch a robust extraction technique is used. Spectrograms and informal listening tests reveal that a clean speech signal can be successfully reconstructed from the noisy MFCC. Pitch errors are shown to manifest themselves as artificial sounding bursts in the reconstructed speech signal. Incorrect estimates of the spectral envelope introduce periods of noise into the reconstructed speech.
Original languageEnglish
PagesI-704-707
DOIs
Publication statusPublished - Apr 2003
EventIEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) - Philadelphia, United States
Duration: 18 Mar 200523 Mar 2005

Conference

ConferenceIEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
Country/TerritoryUnited States
CityPhiladelphia
Period18/03/0523/03/05

Cite this