应用科学学报 ›› 2009, Vol. 27 ›› Issue (5): 514-519.

• 信号与信息处理 • 上一篇    下一篇

改进波形相似叠加算法的音频时长调整

许雪琼;余小清;李昌莲;万旺根   

  1. 上海大学通信与信息工程学院,上海200072
  • 收稿日期:2009-03-05 修回日期:2009-05-06 出版日期:2009-09-25 发布日期:2009-09-25
  • 通信作者: 万旺根,教授,博导,研究方向:音频信号处理、数据挖掘、虚拟现实等,E-mail: wanwg@staff.shu.edu.cn
  • 基金资助:
    国家自然科学基金(No.60872115);上海市科委国际合作基金(No.075107035);上海市教委电路与系统重点学科基金(No.J50104);上海市重点学科和科委重点实验室基金(No.S30108)资助项目

Time-Scale Modification of Audio Signal Using Improved WSOLA Algorithm

  1. School of Communication and Information Engineering, Shanghai University, Shanghai 200072, China
  • Received:2009-03-05 Revised:2009-05-06 Online:2009-09-25 Published:2009-09-25

摘要:

针对波形相似叠加算法在处理高采样率音频时效率低的缺点,提出由短时均值包络到细化波形的逐步匹配方法. 首先基于短时均值包络进行粗匹配,在此基础上细化包络,进行再匹配以实现音频时长调整. 该算法降低了计算量,提高了运算效率. 在进行音频时长调整过程中,还利用音频的优化低能量率特征参数动态调整分析窗长度,实验表明这种处理方法对混合音频的处理效果有很大改进.

关键词: 音频时长调整, 调整因子, 短时均值包络, 互相关系数, 优化低能量率

Abstract:

To improve efficiency of the waveform similarity overlap-and-add (WSOLA) algorithm in audio signal processing at high sampling rate, this paper proposes a matching method that is progressively performed from the short time mean envelop to the signal waveform. We compute a rough matching envelop based on short time mean envelop, and then perform an exact waveform matching for time-scale modification of the audio signal. The
algorithm reduces computation complexity, and improves efficiency with good outcome. In addition, the length of
analysis windows is dynamically adjusted based on the modified low energy ratio parameter. Experiments show that it significantly improves processing results of mixed audio.

Key words: time-scale modification of audio signal, time-scaling factor, short time mean envelop, cross-correlation coefficient, modified low energy ratio

中图分类号: