应用科学学报 ›› 2022, Vol. 40 ›› Issue (5): 838-849.doi: 10.3969/j.issn.0255-8297.2022.05.013

• 计算机科学与应用 • 上一篇    下一篇

神经网络剪枝技术研究综述

姜晓勇1,2, 李忠义1, 黄朗月1, 彭孟乐1, 徐书杨1   

  1. 1. 浙江科技学院 机械与能源工程学院, 浙江 杭州 310023;
    2. 浙江大学 机械工程学院, 浙江 杭州 310058
  • 收稿日期:2021-09-12 出版日期:2022-09-30 发布日期:2022-09-30
  • 通信作者: 姜晓勇,教授级高工,研究方向为机器视觉。E-mail:11525074@zju.edu.cn E-mail:11525074@zju.edu.cn

Review of Neural Network Pruning Techniques

JIANG Xiaoyong1,2, LI Zhongyi1, HUANG Langyue1, PENG Mengle1, XU Shuyang1   

  1. 1. School of Mechanical and Energy Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, Zhejiang, China;
    2. School of Mechanical Engineering, Zhejiang University, Hangzhou 310058, Zhejiang, China
  • Received:2021-09-12 Online:2022-09-30 Published:2022-09-30

摘要: 本文梳理了神经网络剪枝技术的起源与研究进展,将其分为对权重参数稀疏化的非结构化剪枝和粗粒度的结构化剪枝,分别介绍了两者近年来具有代表性的方法。由于剪枝减少了模型参数,压缩了模型大小,使得深度模型能应用于嵌入式设备,表现出剪枝在深度学习模型压缩领域中的重要性。针对现有剪枝技术,阐述了一些在实际应用和衡量标准上存在的问题,并对未来的研究发展方向进行了展望。

关键词: 深度卷积神经网络, 深度学习, 模型压缩, 剪枝

Abstract: This paper summaries the origin and research progress of neural network pruning technologies, divides them into two categories of unstructured pruning with sparse weight parameters and coarse-grained structured pruning, and introduces the representative methods of the two categories in recent years. Because pruning reduces model parameters and compresses the model size, depth models can be applied to embedded devices, showing the importance of pruning in the field of deep learning model compression. In view of the existing pruning technologies, this paper expounds the problems existing in practical applications and measurement standards, and prospects the research and development tendency in the future.

Key words: deep convolutional neural network, deep learning, model compression, pruning

中图分类号: