信号与信息处理

听觉模型倒谱系数及其在声目标识别中的应用

展开
  • 1. 解放军电子工程学院信息工程系,合肥230037
    2. 安徽省电子制约技术重点实验室,合肥230037
    3. 63889部队,河南孟州454750
刘辉,博士生,研究方向:声目标识别、智能计算,E-mail: christ592604@yahoo.com.cn;杨俊安,教授,博导,研究方向:模式识别、智能计算,E-mail: Jun-anyang@ustc.edu.cn

收稿日期: 2010-08-14

  修回日期: 2010-11-17

  网络出版日期: 2011-01-25

基金资助

国家自然科学基金(No.60872113)资助

Cepstrum Coefficient Based on Human Auditory Model with Application to Recognition of Acoustic Targets

Expand
  • 1. Department of Information Engineering, Electronic Engineering Institute of PLA, Hefei 230037, China
    2. Key Laboratory of Electronic Restriction of Anhui Provnice, Hefei 230037, China
    3. The Army 63889 Unit, Mengzhou 454750,Henan Province, China

Received date: 2010-08-14

  Revised date: 2010-11-17

  Online published: 2011-01-25

摘要

 针对目前广泛采用的美尔倒谱系数(MFCC)鲁棒性不足的问题,基于人类听觉模型提出了一种可用于战场声目标识别的倒谱系数. 用小波包变换代替了传统的傅里叶变换,克服了傅里叶变换在频域上单分辨率的缺陷和对噪声的敏感性. 用指数压缩替换固定的对数压缩,较好地模拟了人耳处理信号的非线性能力. 在SensIT实验数据和外场实际采集的低空目标数据上的实验结果表明:相对于经典的美尔倒谱系数,本文提出的倒谱系数在识别准确性和抗噪声能力方面都有较明显的提高.

本文引用格式

刘辉1;2, 杨俊安1;2, 周志增3 . 听觉模型倒谱系数及其在声目标识别中的应用[J]. 应用科学学报, 2011 , 29(1) : 51 -55 . DOI: 10.3969/j.issn.0255-8297.2011.01.009

Abstract

 This paper proposes a cepstrum coefficient model to overcome deficiency in the classic Mel-frequency cepstral coefficients (MFCC). Based on the human auditory model, two improvements are made in computing the new cepstrum features for acoustic target recognition. First, by allowing variable time-frequency resolution and to be immune to noise, wavelet packet transform (WPT) is used instead of FFT in the computation of classic MFCC. This overcomes deficiency of single resolution and reduces sensitivity to complicated battlefield environments. Second, based on the human auditory model, compression with the exponential-law is used instead of the fixed logarithm amplitude spectrum used in the classic MFCC. Experimental results show that the new algorithm can achieve significant advancement over the former method in accuracy and robustness.

文章导航

/