Computer Science and Applications

Research on Pipe Sorting Based on Improved Mask R-CNN Using Feature Fusion and Region Generation Network

Expand
  • 1. School of Computer Science and Technology, North University of China, Taiyuan 030051, Shanxi, China;
    2. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan 030051, Shanxi, China;
    3. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, Shanxi, China

Received date: 2022-07-18

  Online published: 2023-09-28

Abstract

In order to solve the problems of difficulty in distinguishing various pipe fittings and interference of light and shadow on segmentation accuracy, a pipe fittings sorting algorithm based on improved Mask R-CNN is proposed. The feature fusion network is improved by incorporating low-level feature map, improving the recognition rate of small pipe fittings. The generation box of area growth network is modified according to the size ratio of pipe fitting, so to accelerate the convergence rate of the model. Introduction of the channel and space attention module enhances the identification accuracy of pipe fitting and mask effect. The improved Mask R-CNN is applied to the sorting task of four types of pipe fittings, demonstrating increased mAP and mRecall values for mask detection (1.5% and 1.7% improvement, respectively). The robot exhibits enhanced capabilities in discriminating the location, type and size of pipe fittings, thereby meeting the accuracy requirements of sorting pipe fittings in actual production.

Cite this article

HAN Huiyan, WU Weizhou, WANG Wenjun, HAN Xie . Research on Pipe Sorting Based on Improved Mask R-CNN Using Feature Fusion and Region Generation Network[J]. Journal of Applied Sciences, 2023 , 41(5) : 840 -854 . DOI: 10.3969/j.issn.0255-8297.2023.05.010

References

[1] 张慧, 王坤峰, 王飞跃. 深度学习在目标视觉检测中的应用进展与展望[J]. 自动化学报, 2017, 43(8):1289-1305. Zhang H, Wang K F, Wang F Y. Advances and perspectives on applications of deep learning in visual object detection[J]. Acta Automatica Sinica, 2017, 43(8):1289-1305. (in Chinese)
[2] 贾鹤鸣, 李瑶, 姜子超, 等. 基于改进共生生物搜索算法的林火图像多阈值分割[J]. 计算机应用, 2021, 41(5):1465-1470. Jia H M, Li Y, Jiang Z C, et al. Multi-threshold segmentation of forest fire images based on modified symbiotic organisms search algorithm[J]. Journal of Computer Applications, 2021, 41(5):1465-1470. (in Chinese)
[3] Hariharan B, Arbeláez P, Girshick R, et al. Simultaneous detection and segmentation[C]//European Conference on Computer Vision, 2014:297-312.
[4] Pinheiro P O, Collobert R, Dollár P. Learning to segment object candidates[J]. Advances in Neural Information Processing Systems, 2015, 28.
[5] He K, Gkioxari G, Dollár P, et al. Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017:2961-2969.
[6] Ren S, He K, Girshick R, et al. Faster R-CNN:towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6):1137-1149.
[7] Zhang Y, Cheng W. Vision-based robot sorting system[C]//IOP Conference Series:Materials Science and Engineering, 2019, 592(1):012154.
[8] 梁新宇, 林洗坤, 权冀川, 等. 基于深度学习的图像实例分割技术研究进展[J]. 电子学报, 2020, 48(12):2476-2486. Liang X Y, Lin X K, Quan J C, et al. Research on the progress of image instance segmentation based on deep learning[J]. Acta Electronica Sinica, 2020, 48(12):2476-2486. (in Chinese)
[9] 王诗宇. 智能化工业机器人视觉系统关键技术研究[D]. 沈阳:中国科学院大学(中国科学院沈阳计算技术研究所), 2021.
[10] Qi C R, Su H, Mo K, et al. Pointnet:deep learning on point sets for 3D classification and segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017:652-660.
[11] Liang H, Ma X, Li S, et al. Pointnetgpd:detecting grasp configurations from point sets[C]//2019 International Conference on Robotics and Automation. IEEE, 2019:3629-3635.
[12] Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017:2117-2125.
[13] Nagaoka Y, Miyazaki T, Sugaya Y, et al. Text detection using multi-stage region proposal network sensitive to text scale[J]. Sensors, 2021, 21(4):1232.
[14] 戚裕绮, 范成杰, 陈欣, 等. 基于无锚框关键点算法的工件表面缺陷检测[J]. 计算机工程与设计, 2023, 44(1):203-209. Qi Y Q, Fan C J, Chen X, et al. Defect detection of workpiece surface based on anchorfree and keypoint algorithm[J]. Computer Engineering and Design, 2023, 44(1):203-209. (in Chinese)
[15] 林凯瀚, 赵慧民, 吕巨建, 等. 基于Mask R-CNN的人脸检测与分割方法[J]. 计算机工程, 2020, 46(6):274-280. Lin K H, Zhao H M, Lyu J J, et al. Face detection and segmentation method based on Mask R-CNN[J]. Computer Engineering, 2020, 46(6):274-280. (in Chinese)
[16] Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018:7132-7141.
[17] 季秀怡, 李建华. 基于双路注意力机制的化学结构图像识别[J]. 计算机工程, 2020, 46(9):213-220. Ji X L, Li J H. Chemical structure image recognition based on dual attention mechanism[J]. Computer Engineering, 2020, 46(9):213-220. (in Chinese)
[18] 周鑫, 钟琴. 基于离散余弦变换的高光谱图像复原方法[J]. 激光杂志, 2023, 44(2):118-122. Zhou X, Zhong Q. Hyperspectral image restoration method based on discrete cosine transform[J]. Laser Journal, 2023, 44(2):118-122. (in Chinese)
[19] Qin Z, Zhang P, Wu F, et al. FCANET:frequency channel attention networks[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021:783-792.
[20] Barukab O, Ali F, Alghamdi W, et al. DBP-CNN:Deep learning-based prediction of DNAbinding proteins by coupling discrete cosine transform with two-dimensional convolutional neural network[J]. Expert Systems with Applications, 2022, 197:116729.
[21] 李昂, 孙士杰, 张朝阳, 等. 改进YOLOv5s的轨道障碍物检测模型轻量化研究[J]. 计算机工程与应用, 2023, 59(4):197-207. Li A, Sun S J, Zhang Z Y, et al. Research on lightweight of improved YOLOv5s track obstacle detection model[J]. Computer Engineering and Applications, 2023, 59(4):197-207. (in Chinese)
[22] 唐超, 左文涛, 李小飞. 结合修剪均值与高斯加权中值滤波的图像去噪算法[J]. 计算机工程, 2021, 47(9):210-216. Tang C, Zuo W T, Li X F. Image denoising algorithm combining trimmed mean and gaussian weighted median filtering[J]. Computer Engineering, 2021, 47(9):210-216. (in Chinese)
[23] Wang X, Kong T, Shen C, et al. Solo:segmenting objects by locations[C]//Computer Vision-ECCV 2020, 2020:649-665.
Outlines

/