首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:Statistical Model-Based Voice Activity Detection Based on Second-Order Conditional MAP with Soft Decision
  • 本地全文:下载
  • 作者:Chang, Joon-Hyuk
  • 期刊名称:ETRI Journal
  • 印刷版ISSN:1225-6463
  • 电子版ISSN:2233-7326
  • 出版年度:2012
  • 卷号:34
  • 期号:2
  • 页码:184-189
  • DOI:10.4218/etrij.12.0111.0344
  • 语种:English
  • 出版社:Electronics and Telecommunications Research Institute
  • 摘要:In this paper, we propose a novel approach to statistical model-based voice activity detection (VAD) that incorporates a second-order conditional maximum a posteriori (CMAP) criterion. As a technical improvement for the first-order CMAP criterion in [1], we consider both the current observation and the voice activity decision in the previous two frames to take full consideration of the interframe correlation of voice activity. This is clearly different from the previous approach [1] in that we employ the voice activity decisions in the second-order (previous two frames) CMAP, which has quadruple thresholds with an additional degree of freedom, rather than the first-order (previous single frame). Also, a soft-decision scheme is incorporated, resulting in time-varying thresholds for further performance improvement. Experimental results show that the proposed algorithm outperforms the conventional CMAP-based VAD technique under various experimental conditions.
  • 关键词:Voice activity detection;second-order conditional MAP;soft decision;likelihood ratio test
国家哲学社会科学文献中心版权所有