Enhancement of throat microphone speech using Empirical Mode Decomposition (EMD)

Indraneel Misra and Md. Easir Arafat *

Computer Science and Engineering, Pundra University of Science and Technlogy, Bogura, Rajshahi, Bangladesh.
 
Research Article
International Journal of Science and Research Archive, 2024, 12(02), 2149–2156.
Article DOI: 10.30574/ijsra.2024.12.2.1497
Publication history: 
Received on 06 July 2024; revised on 17 August 2024; accepted on 19 August 2024
 
Abstract: 
This paper presents a novel approach for enhancing the quality of throat microphone (TM) speech using Empirical Mode Decomposition (EMD). TM speech is known for its robustness in noisy environments but often suffers from poor intelligibility and unnatural sound due to the absence of high-frequency components. To address this issue, we propose using EMD to decompose the TM speech signal into intrinsic mode functions (IMFs) and selectively enhance components that contribute to improved speech clarity. The performance of the proposed method is evaluated using Perceptual Evaluation of Speech Quality (PESQ) scores, Signal-to-Noise Ratio (SNR), comparison of Linear Predictive Coding (LPC) spectra and Spectrogram analysis. Results demonstrate significant improvements in speech quality, making the approach a promising solution for applications requiring reliable communication in adverse conditions.
 
Keywords: 
Throat Microphone; Empirical Mode Decomposition (EMD); Speech Enhancement; Perceptual Evaluation of Speech Quality (PESQ); Signal-to-Noise Ratio (SNR).
 
Full text article in PDF: