应用科学学报 ›› 2011, Vol. 29 ›› Issue (1): 51-55.doi: 10.3969/j.issn.0255-8297.2011.01.009

• 信号与信息处理 • 上一篇    下一篇

听觉模型倒谱系数及其在声目标识别中的应用

刘辉1;2, 杨俊安1;2, 周志增3   

  1. 1. 解放军电子工程学院信息工程系,合肥230037
    2. 安徽省电子制约技术重点实验室,合肥230037
    3. 63889部队,河南孟州454750
  • 收稿日期:2010-08-14 修回日期:2010-11-17 出版日期:2011-01-26 发布日期:2011-01-25
  • 作者简介:刘辉,博士生,研究方向:声目标识别、智能计算,E-mail: christ592604@yahoo.com.cn;杨俊安,教授,博导,研究方向:模式识别、智能计算,E-mail: Jun-anyang@ustc.edu.cn
  • 基金资助:

    国家自然科学基金(No.60872113)资助

Cepstrum Coefficient Based on Human Auditory Model with Application to Recognition of Acoustic Targets

LIU Hui1;2, YANG Jun-an1;2, ZHOU Zhi-zeng3   

  1. 1. Department of Information Engineering, Electronic Engineering Institute of PLA, Hefei 230037, China
    2. Key Laboratory of Electronic Restriction of Anhui Provnice, Hefei 230037, China
    3. The Army 63889 Unit, Mengzhou 454750,Henan Province, China
  • Received:2010-08-14 Revised:2010-11-17 Online:2011-01-26 Published:2011-01-25

摘要:

 针对目前广泛采用的美尔倒谱系数(MFCC)鲁棒性不足的问题,基于人类听觉模型提出了一种可用于战场声目标识别的倒谱系数. 用小波包变换代替了传统的傅里叶变换,克服了傅里叶变换在频域上单分辨率的缺陷和对噪声的敏感性. 用指数压缩替换固定的对数压缩,较好地模拟了人耳处理信号的非线性能力. 在SensIT实验数据和外场实际采集的低空目标数据上的实验结果表明:相对于经典的美尔倒谱系数,本文提出的倒谱系数在识别准确性和抗噪声能力方面都有较明显的提高.

关键词: 声目标识别, 美尔倒谱系数, 听觉模型

Abstract:

 This paper proposes a cepstrum coefficient model to overcome deficiency in the classic Mel-frequency cepstral coefficients (MFCC). Based on the human auditory model, two improvements are made in computing the new cepstrum features for acoustic target recognition. First, by allowing variable time-frequency resolution and to be immune to noise, wavelet packet transform (WPT) is used instead of FFT in the computation of classic MFCC. This overcomes deficiency of single resolution and reduces sensitivity to complicated battlefield environments. Second, based on the human auditory model, compression with the exponential-law is used instead of the fixed logarithm amplitude spectrum used in the classic MFCC. Experimental results show that the new algorithm can achieve significant advancement over the former method in accuracy and robustness.

Key words: acoustic target recognition, MFCC, auditory mode