应用科学学报 ›› 2014, Vol. 32 ›› Issue (2): 149-155.doi: 10.3969/j.issn.0255-8297.2014.02.006

• 信号与信息处理 • 上一篇    下一篇

基于动态匹配词格检索的关键词检测

郑永军, 张连海   

  1. 信息工程大学信息系统工程学院,郑州450002
  • 收稿日期:2013-07-15 修回日期:2013-10-22 出版日期:2014-03-25 发布日期:2013-10-22
  • 作者简介:张连海,博士,副教授,研究方向:语音信号处理、模式识别,E-mail: lianhaiz@sina.com
  • 基金资助:

    国家自然科学基金(No.61175017);全军军事学研究课题基金(No.2010JY0256-143)资助

Keyword Detection Based on Dynamic Match Lattice Spotting

ZHENG Yong-jun, ZHANG Lian-hai   

  1. Institute of Information Systems Engineering, Information Engineering University, Zhengzhou 450002, China
  • Received:2013-07-15 Revised:2013-10-22 Online:2014-03-25 Published:2013-10-22

摘要: 对生活中涌现的海量语音数据需要进行快速而准确的检索. 提出一种基于动态匹配词格检索的关键词检测方法,应用TRAP 特征和多层感知器创建更为精准的音素Lattice. 在索引阶段执行一个改进的维特比算法遍历Lattice 来创建一个固定长度的音素序列数据库,在检索阶段应用最小编辑距离作为置信度来实现关键词的检
出. 实验结果表明,该方法相比应用MFCC 和PLP 特征的基线系统具有一定的优势,召回率可提升5% 左右.

关键词: 关键词检测, 动态匹配词格检索, TRAP特征, 最小编辑距离

Abstract: The large amount of speech data requires techniques for rapid and accurate search. This paper proposes a keyword spotting method based on dynamic match Lattice spotting (DMLS). It generates more accurate phone Lattice with TRAP features and multilayer perceptron, and performs a modified Viterbi traversal to compile a database of fixed-length phone sequences in speech indexing. In the searching stage, a minimum edit distance is used as the confidence score to implement the keyword spotting. Tests show that the proposed method is superior to baseline systems with MFCC and PLP features with the recall rate improved by about 5%.

Key words: keyword spotting, dynamic match Lattice spotting, TRAP feature, minimum edit distance

中图分类号: