应用科学学报 ›› 2000, Vol. 18 ›› Issue (1): 80-84.

• 论文 • 上一篇    下一篇

基于二阶差分耳蜗模型的语音识别新方法

余小清, 万旺根, 陶安, 袁京贤   

  1. 上海大学通信与信息工程学院, 上海 200072
  • 收稿日期:1998-07-30 修回日期:1999-01-31 出版日期:2000-03-31 发布日期:2000-03-31
  • 作者简介:余小清(1958-),女,安徽黟县人,副教授,硕士.
  • 基金资助:
    国家自然科学基金(69501007)、上海市启明星计划(96QD14008)、上海市曙光计划(98SG38)资助课题

A New Approach of Speech Recognition Based on Second-Order Difference Cochlear Model

YU Xiao-qing, WAN Wang-gen, TAO An, YUAN Jing-xian   

  1. Communication and Information Engineering Institute, Shanghai University, Shanghai 200072, China
  • Received:1998-07-30 Revised:1999-01-31 Online:2000-03-31 Published:2000-03-31

摘要: 采用二阶差分耳蜗模型对语音信号进行特征参数提取,获得了基于听觉谱的语音识别前端特征参数,同时根据听觉谱特征提出了一种"幅和频差积"距离测度,识别算法采用端点放松两帧,路径斜率限制在1/2到2之间的改进型DTW算法.在小词汇量非特定人(SI)的识别环境下,计算机模拟结果表明此法在对0~9十个数字以及小词汇量的SI识别时,其正识率可达98%以上,且具有较好的鲁棒性.

关键词: 语音识别, 二阶差分耳蜗模型, 听觉谱特征

Abstract: In this paper, the second -order difference cochlear model is used to extract the speech parameters. A kind of speech recognition front-end parameters based on auditory spectrum is obtained. A new "amplitude sum multiplied by frequency difference" distance measure is proposed according to the feature of speech parameters. The recognition algorithm is an improved DTW algorithm that sets two free frames in the beginning of speech segments and has the trace slope between 1/2 and 2. Under the recognition condition of small vocabulary or digits vocabulary and speaker independence, computer simulation shows that the algorithm attains an recognition accuracy of at least 98 percent, and it has the quite good robustness as well.

Key words: speech recognition, second-order difference cochlear model, auditory spectrum based speech parameter

中图分类号: