Jointly gaussian pdf based likelihood ratio test for voice activity detection

Górriz, J. M. and Lang, Elmar and Ramírez, J. and Puntonet, Carlos G. (2008) Jointly gaussian pdf based likelihood ratio test for voice activity detection. IEEE transactions on audio, speech and language processing: T-ASL 16 (8), pp. 1565-1578.

Full text not available from this repository.

Abstract

This paper presents a novel voice activity detector (VAD) for improving speech detection robustness in noisy environments and the performance of speech recognition systems in real-time applications. The algorithm is based on a generalized complex Gaussian (GCG) observation model and defines an optimal likelihood ratio test (LRT) involving multiple and correlated observations (MCO) based on jointly Gaussian probability distribution functions (jGpdf). An extensive analysis of the proposed methodology for a low dimensional observation model demonstrates 1) the improved robustness of the proposed approach by means of a clear reduction of the classification error as the number of observations is increased, and 2) the tradeoff between the number of observations and the detection performance. The proposed strategy is also compared to different VAD methods including the G.729, AMR, and AFE standards, as well as other recently reported algorithms showing a sustained advantage in speech/nonspeech detection accuracy and speech recognition performance.

Item Type:Article
Institutions: Biology, Preclinical Medicine > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Elmar Lang
Identification Number:
ValueType
10.1109/TASL.2008.2004293 DOI
Subjects:500 Science > 570 Life sciences
Status:Published
Refereed:Unknown
Created at the University of Regensburg:Unknown
Owner:Gertraud Kellers
Deposited On:04 Oct 2010 11:36
Last Modified:04 Oct 2010 11:36
Item ID:16906
Owner Only: item control page