Computer Science and Information Systems 2011 Volume 8, Issue 3, Pages: 869-888
Full text ( 406 KB)

Voice activity detection method based on multivalued coarse-graining Lempel-Ziv complexity

Zhao Huan, Wang Gangjin, Xu Cheng, Yu Fei

One of the key issues in practical speech processing is to locate precisely endpoints of the input utterance to be free of nonspeech regions. Although lots of studies have been performed to solve this problem, the operation of existing voice activity detection (VAD) algorithms is still far away from ideal. This paper proposes a novel robust feature for VAD method that is based on multi-valued coarsegraining Lempel-Ziv Complexity (MLZC), which is an improved algorithm of the binary coarse-graining Lempel-Ziv Complexity (BLZC). In addition, we use fuzzy c-Means clustering algorithm and the Bayesian information criterion algorithm to estimate the thresholds of the MLZC characteristic, and adopt the dual-thresholds method for VAD. Experimental results on the TIMIT continuous speech database show that at low SNR environments, the detection performance of the proposed MLZC method is superior to the VAD in GSM ARM, G.729 and BLZC method.

Keywords: speech processing, voice activity detection, Lempel-Ziv complexity, multi-valued coarse-graining, fuzzy c-Means clustering algorithm, Bayesian information criterion algorithm

More data about this article available through SCIndeks