应用科学学报 ›› 1998, Vol. 16 ›› Issue (3): 320-325.

• 论文 • 上一篇    下一篇

基于加权全局时频特征的易混淆词识别

顾明亮, 王太君, 史笑兴, 何振亚   

  1. 东南大学
  • 收稿日期:1997-04-26 修回日期:1997-10-12 出版日期:1998-09-30 发布日期:1998-09-30
  • 作者简介:顾明亮:博士生,东南大学无线电工程系,南京 210018
  • 基金资助:
    国家攀登计划认知科学(神经网络)重大关键资助

WGTF Feature-Based Confusing Word Recognition

GU MINGLIANG, WANG TAIJUN, SHI XIAOXING, HE ZHENYA   

  1. Southeast University, Nanjing 210018
  • Received:1997-04-26 Revised:1997-10-12 Online:1998-09-30 Published:1998-09-30

摘要: 针对易混淆词特征差异小、分类决策困难的特点,提出了一种新的语音识别特征.该特征可以根据待识单词的发音特点,通过选用合适的基函数及加权处理,突出混淆单词特征之间的差异性;同时,根据其矢量维数相等的特点,利用静态神经网络分类决策能力强、容错性好的优点进一步提高系统的识别性能.实验结果表明,所用方法比传统的DHMM方法和其他神经网络语音识别方法具有更好的识别效率.

关键词: 声道模型, 权函数, GTF特征, 语音识别, 神经网络

Abstract: This paper presents a novel feature (Weighted Global Time-Frequency feature, i.e WGTF) for confusing word speech recognition, which enhances the difference among different confusing words by selecting proper base fuctions and weighting functions. Meanwhile, the storng discriminative power of artificial neural network has been used as a classifier to further raise the recognition rate. The experiment shows that the proposed method outperforms the standard DHMM and other ANN-based method.

Key words: vocal tract model, GTF feature, neural network, speech recognition, weighting function