应用科学学报 ›› 2023, Vol. 41 ›› Issue (6): 967-977.doi: 10.3969/j.issn.0255-8297.2023.06.005

• 信号与信息处理 • 上一篇    下一篇

基于SASK和双分支结构的服装图像识别方法

周啸辉, 余磊, 张睿婷, 熊邦书, 欧巧凤   

  1. 南昌航空大学 图像处理与模式识别江西省重点实验室, 江西 南昌 330063
  • 收稿日期:2021-12-03 出版日期:2023-11-30 发布日期:2023-11-30
  • 通信作者: 余磊,副教授,研究方向为图像处理及应用。E-mail:yulei@nchu.edu.cn E-mail:yulei@nchu.edu.cn
  • 基金资助:
    国家自然科学基金(No.62162044,No.61866027);江西省重点研发计划项目基金(20212BBE53017);江西省自然科学基金(No.20202BAB202016);南昌航空大学研究生创新专项基金(No.YC2020043)资助

Clothing Image Recognition Method Based on SASK and Double Branch Structure

ZHOU Xiaohui, YU Lei, ZHANG Ruiting, XIONG Bangshu, OU Qiaofeng   

  1. Key Laboratory of Image Processing and Pattern Recognition of Jiangxi Province Nanchang Hangkong University, Nanchang 330063, Jiangxi, China
  • Received:2021-12-03 Online:2023-11-30 Published:2023-11-30

摘要: 服装图像具有明暗不一、尺度各异的特性,这使得已有识别方法表现不佳。为解决此问题,本文基于空间注意力选择核(space attention selective kernel,SASK)模块和双分支结构搭建神经网络模型对服装图像进行识别。首先,结合跳跃连接、稠密连接和多尺度、通道拆分的思想,设计双分支神经网络,充分提取服装对象的整体特征。其次,基于空间注意力机制,设计SASK模块,使网络可以更多地关注服装对象的形态特征信息,从而提升识别效果。实验结果表明,本文所提方法不但在典型服装数据集上能够取得优于现有主流方法的识别精度,而且在具有明暗不一、尺度各异特性的其他图像数据集上同样表现良好。

关键词: 服装图像识别, 空间注意力选择核, 双分支神经网络, 明暗不一, 尺度各异

Abstract: Due to the limited performance of the existing recognition methods for clothing images with varying brightness and scales, we propose a neural network model based on spatial attention selective kernel(SASK) and double branch structure in this paper.Firstly, a double branch neural network that incorporates jump connection, dense connection, multi-scale and channel splitting is established to fully extract the overall features of clothing images. Secondly, the SASK module based on the spatial attention mechanism is designed to enable the network to focus on the morphological feature information of clothing images for accurate recognition. Experimental results show that our method not only improves the recognition accuracy of typical clothing datasets compared to existing mainstream methods, but also performs effectively on image datasets with different brightness and scales.

Key words: clothing image recognition, spatial attention selective kernel(SASK), twobranch neural network, different brightness, different scales

中图分类号: