基于深度学习的竹笛吹奏技巧自动分类

doi:10.3969/j.issn.0255-8297.2021.04.015

应用科学学报 ›› 2021, Vol. 39 ›› Issue (4): 685-694.doi: 10.3969/j.issn.0255-8297.2021.04.015

• CCF NCCA 2020专辑 • 上一篇

基于深度学习的竹笛吹奏技巧自动分类

郭毓博¹, 陆军^1,2, 段鹏启¹

1. 黑龙江大学计算机科学技术学院, 黑龙江哈尔滨 150080;
2. 黑龙江大学黑龙江省数据库与并行计算重点实验室, 黑龙江哈尔滨 150080

收稿日期:2020-08-26 发布日期:2021-08-04
通信作者: 陆军,教授,研究方向为人工智能、深度学习、数据挖掘。E-mail:lujun111_lily@sina.com E-mail:lujun111_lily@sina.com

Automatic Classification of Bamboo Flute Playing Skills Based on Deep Learning

GUO Yubo¹, LU Jun^1,2, DUAN Pengqi¹

1. College of Computer Science and Technology, Heilongjiang University, Harbin 150080, Heilongjiang, China;
2. Key Laboratory of Database and Parallel Computing of Heilongjiang Province, Heilongjiang University, Harbin 150080, Heilongjiang, China

Received:2020-08-26 Published:2021-08-04

摘要/Abstract

摘要： 提出了一种针对竹笛技巧分类的数据集Breath和两个用于竹笛技巧分类的神经网络参考模型Breath1d和Breath2d，并针对此数据集的不同分类任务给出了最佳方法。将Breath数据集划分成子集，以多层感知机为性能评价基准方法，先用Breath1d和Breath2d模型对子集进行训练和预测，再用长短期记忆网络模型进行辅助测试，最后得出了最适合子任务的分类参考模型。对全数据集进行分类时，将Breath2d与Breath1d模型进行融合，并采用数据增强方法使全集分类准确率达到0.913。与传统音频分类任务相比，该工作扩展了音乐分类的研究领域，对民族音乐现代化发展有着良好的推动作用。

关键词: 人工智能, 模式识别, 神经网络, 深度学习, 音频分类

Abstract: A dataset named Breath and two neural network reference models named Breath1d and Breath2d respectively are proposed for bamboo flute skill classification, and the optimal method is achieved for different classification tasks on this dataset. This paper divides the Breath dataset into subsets, and takes the multi-layer perceptron as the benchmark method of performance evaluation. First, the subsets are trained and predicted by the breath1d and breath2d models, and then the long short-term memory (LSTM) network model is used for auxiliary testing. Finally, the most suitable classification reference model for subtasks is obtained. When the whole dataset is classified, the breath2d and breath1d models are fused, and the data enhancement method is used. All of these make the classification accuracy of the whole dataset reach 91.3%. Compared with traditional audio classification tasks, this work expands the research field of music classification, and has a great effect on the modernization of national music.

Key words: artificial intelligence, pattern recognition, neural network, deep learning, audio classification

中图分类号:

TP391.4

郭毓博, 陆军, 段鹏启. 基于深度学习的竹笛吹奏技巧自动分类[J]. 应用科学学报, 2021, 39(4): 685-694.

GUO Yubo, LU Jun, DUAN Pengqi. Automatic Classification of Bamboo Flute Playing Skills Based on Deep Learning[J]. Journal of Applied Sciences, 2021, 39(4): 685-694.

参考文献

[1] 刘子阳. 浅谈竹笛演奏技巧与乐曲情感表达之间的关系[J]. 北方音乐, 2017, 37(22):60. Liu Z Y. On the relationship between performances kills of bamboo flute and emotional expression of music[J]. Northern Music, 2017, 37(22):60. (in Chinese).
[2] Fu Z, Lu G, Ting K M. A survey of audio-based music classification and annotation[J]. IEEE Transactions on Multimedia, 2010, 13(2):303-319.
[3] Schedl M, Gómez E, Urbano J. Music information retrieval:recent developments and applications[J]. Foundations and Trends in Information Retrieval, 2014, 8(2/3):127-261.
[4] 王悦虹. 民乐美感判断与持续时长的关联分析[C]//中国声学学会2019年全国声学大会论文集. 北京:中国声学学会, 2019:543-544.
[5] 陈燕文. 基于人工神经网络的琵琶声学品质评价及其音符识别[D]. 太原:中北大学, 2019.
[6] 王芳. 基于尝试学习的音乐流派及中国传统乐器中识别分类研究[D]. 南京:南京理工大学, 2017.
[7] Liu Y J, Zhang J J, Xiao Z Z. Grid diagram features for automatic Pipa fingering technique classification[C]//201912th International Symposium on Computational Intelligence and Design, 2019(1):24-28.
[8] Schmidhuber J. Deep learning in neural networks:an overview[J]. Neural Networks, 2015, 61:85-117.
[9] Weihs C, Ligges U, Morchen F. Classification in music research[J]. Advances in Data Analysis and Classification, 2007, 1(3):255-291.
[10] Chatterjee S. An optimized music recognition system using Mel-frequency cepstral coefficient (MFCC) and vector quantization (VQ)[C]//Research Directions:Special Issue International Business Research Conference on Transformation Opportunities and Sustainability Challenges in Technology and Management, 2019(45489):100-106.
[11] 司亚辉. 浅议竹笛演奏风格流派及演奏技法[J]. 北方音乐, 2020(3):50-53. Si Y H. On the style and technique of bamboo flute playing[J]. Northern Music, 2020(3):50-53. (in Chinese).
[12] Abdoli S, Cardinal P, Koerich A L. End-to-end environmental sound classification using a 1D convolutional neural network[J]. Expert Systems with Applications, 2019, 136:252-263.
[13] Bian W, Wang J, Zhuang B. Audio-based music classification with DenseNet and data augmentation[C]//Pacific Rim International Conference on Artificial Intelligence. Cham:Springer, 2019:56-65.
[14] Solanki A, Pandey S. Music instrument recognition using deep convolutional neural networks[J]. International Journal of Information Technology, 2019:1-10.
[15] Lee J, Park J, Kim K L. SampleCNN:end-to-end deep convolutional neural networks using very small filters for music classification[J]. Applied Sciences, 2018, 8(1):150.
[16] 何丽, 袁斌. 利用长短期记忆网络进行音乐流派的分类[J]. 计算机技术与发展, 2019, 29(11):190-194. He L, Yuan B. Classification of music genres using long short term memory network[J]. Computer Technology and Development, 2019, 29(11):190-194. (in Chinese).
[17] Uhlich S, Porcu M, Giron F, et al. Improving music source separation based on deep neural networks through data augmentation and network blending[C]//2017 IEEE International Conference on Acoustics, Speech and Signal Processing, 2017:261-265.

基于深度学习的竹笛吹奏技巧自动分类

Automatic Classification of Bamboo Flute Playing Skills Based on Deep Learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	郑长亮, 庞明. 基于卷积神经网络的时空权重姿态运动特征提取算法[J]. 应用科学学报, 2021, 39(4): 594-604.
[2]	查玉坤, 张其林, 赵永标, 杭波. 基于三维卷积和CLSTM神经网络的水产养殖溶解氧预测[J]. 应用科学学报, 2021, 39(4): 615-626.
[3]	李文举, 何茂贤, 张耀星, 陈慧玲, 李培刚. 基于卷积神经网络和投票机制的轨道板裂缝检测[J]. 应用科学学报, 2021, 39(4): 627-640.
[4]	郝琰, 石慧宇, 霍首君, 韩丹, 曹锐. 基于脑电信号深度学习的情感分类[J]. 应用科学学报, 2021, 39(3): 347-346.
[5]	杜承泽, 段友祥, 孙歧峰. 基于ResUNet和Dense CRF模型的地震裂缝识别方法[J]. 应用科学学报, 2021, 39(3): 367-366.
[6]	崔鹏涛, 张倩, 刘敬怀, 周超, 王斌, 司文. 基于FSCD-CNN的深度图像快速帧内预测模式选择算法[J]. 应用科学学报, 2021, 39(3): 433-432.
[7]	李磊, 张青苗, 赵军辉, 聂逸文. 基于改进CNN-LSTM组合模型的分时段短时交通流预测[J]. 应用科学学报, 2021, 39(2): 185-198.
[8]	王万国, 慕世友, 刘越, 刘广秀, 郎芬玲. 融合深度学习的无人机巡检绝缘子自爆检测研究[J]. 应用科学学报, 2021, 39(2): 222-231.
[9]	张涵, 秦昆, 毕奇, 张晔, 许凯. 注意力引导的三维卷积网络用于遥感场景变化检测[J]. 应用科学学报, 2021, 39(2): 272-280.
[10]	刘之瑜, 张淑芬, 刘洋, 罗长银, 李敏. 基于图像梯度的数据增广方法[J]. 应用科学学报, 2021, 39(2): 302-311.
[11]	马飞虎, 金依辰, 孙翠羽. 基于EMD优化NAR动态神经网络的地铁客流量短时预测模型[J]. 应用科学学报, 2020, 38(6): 936-943.
[12]	尉爽生, 杨忠良, 江旻宇, 黄永峰. 基于神经机器翻译的文本隐写方法[J]. 应用科学学报, 2020, 38(6): 976-985.
[13]	韦健杰, 吕东辉, 陆小锋, 孙广玲. 基于快速特征欺骗的通用扰动生成改进方法[J]. 应用科学学报, 2020, 38(6): 986-994.
[14]	孙权, 汤韬, 郑建宾, 潘婧, 赵金涛. 金融交易数据驱动的图谱网络智能化欺诈侦测[J]. 应用科学学报, 2020, 38(5): 713-723.
[15]	王孟轩, 张胜, 王月, 雷霆, 杜渂. 改进的CRNN模型在警情文本分类中的研究与应用[J]. 应用科学学报, 2020, 38(3): 388-400.