This paper extends the technique of speech reconstruction from MFCC by considering the effect of noisy speech. To reconstruct a clean speech signal from noise contaminated MFCC an estimate of the clean mel-filterbank vector is required together with a robust estimate of the pitch. This work applies spectral subtraction to the mel-filterbank vector (derived from noisy MFCC) to provide a clean speech spectral estimate. To obtain a reliable estimate of pitch a robust extraction technique is used. Spectrograms and informal listening tests reveal that a clean speech signal can be successfully reconstructed from the noisy MFCC. Pitch errors are shown to manifest themselves as artificial sounding bursts in the reconstructed speech signal. Incorrect estimates of the spectral envelope introduce periods of noise into the reconstructed speech.
|Publication status||Published - Apr 2003|
|Event||IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) - Philadelphia, United States|
Duration: 18 Mar 2005 → 23 Mar 2005
|Conference||IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)|
|Period||18/03/05 → 23/03/05|