[1] Alías F, Socoró J, Sevillano X. A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds[J]. Applied Sciences, 2016, 6(5):143. [2] Tripathi A M, Mishra A. Environment sound classification using an attention-based residual neural network[J]. Neurocomputing, 2021, 460:409-423. [3] Piczak K J. Environmental sound classification with convolutional neural networks[C]//2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), 2015:1-6. [4] Tripathi A M, Mishra A. Self-supervised learning for environmental sound classification[J]. Applied Acoustics, 2021, 182:108183. [5] Su Y, Zhang K, Wang J Y, et al. Performance analysis of multiple aggregated acoustic features for environment sound classification[J]. Applied Acoustics, 2020, 158:107050. [6] Peng N, Chen A B, Zhou G X, et al. Environment sound classification based on visual multi-feature fusion and GRU-AWS[J]. IEEE Access, 2020, 8:191100-191114. [7] Mushtaq Z, Su S F, Tran Q V. Spectral images based environmental sound classification using CNN with meaningful data augmentation[J]. Applied Acoustics, 2021, 172:107581. [8] Li S B, Yao Y, Hu J, et al. An ensemble stacked convolutional neural network model for environmental event sound recognition[J]. Applied Sciences, 2018, 8(7):1152. [9] Nanni L, Maguolo G, Brahnam S, et al. An ensemble of convolutional neural networks for audio classification[J]. Applied Sciences, 2021, 11(13):5796. [10] Luz J S, Oliveira M C, Araújo F H D, et al. Ensemble of handcrafted and deep features for urban sound classification[J]. Applied Acoustics, 2021, 175:107819. [11] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[DB/OL]. 2014[2021-09-24]. https://arxiv.org/abs/1409.1556. [12] Piczak K J. ESC:dataset for environmental sound classification[C]//23rd ACM international conference on Multimedia, 2015:1015-1018. [13] Boddapati V, Petef A, Rasmusson J, et al. Classifying environmental sounds using image recognition networks[J]. Procedia Computer Science, 2017, 112:2048-2056. |