应用科学学报 ›› 2019, Vol. 37 ›› Issue (2): 282-290.doi: 10.3969/j.issn.0255-8297.2019.02.013

• 信号与信息处理 • 上一篇    下一篇

利用深度残差网络的高分遥感影像语义分割

李欣1, 唐文莉1, 杨博2,3   

  1. 1. 武汉大学遥感信息工程学院, 武汉 430079;
    2. 武汉大学地球空间信息技术协同创新中心, 武汉 430079;
    3. 武汉大学测绘遥感信息工程国家重点实验室, 武汉 430079
  • 收稿日期:2018-02-05 修回日期:2018-04-15 出版日期:2019-03-31 发布日期:2019-03-31
  • 作者简介:李欣,教授,研究方向:近景摄影测量、工业测量,E-mail:xli2126@whu.edu.cn
  • 基金资助:
    国家自然科学基金(No.41371426,No.41271431)资助

Semantic Segmentation of High-Resolution Remote Sensing Image Based on Deep Residual Network

LI Xin1, TANG Wen-li1, YANG Bo2,3   

  1. 1. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China;
    2. Collaborative Innovation Center of Geospatial Technology, Wuhan University, Wuhan 430079, China;
    3. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China
  • Received:2018-02-05 Revised:2018-04-15 Online:2019-03-31 Published:2019-03-31

摘要: 遥感影像分割是影像解译与分析的必要过程,随着深度学习在特征表达上的优势逐步显现,以深度网络为基础模型的影像语义分割已成为自动分割的主要研究趋势.该文提出了一种基于深度残差网络的多尺度语义分割模型,旨在针对小样本遥感影像数据集,提高具有不同尺度分割对象的遥感影像分割精度.首先将深度残差网络以全卷积网络形式进行微调,实现端到端语义分割模型结构构建;然后针对全卷积网络粗糙分割输出的问题,引入Atrous卷积精细化模型上采样过程,进而提高输出标签图精度;最后针对小样本数据进行随机多尺度数据增强,通过样本扩充提高模型分类精度和鲁棒性.试验基于ISPRS 2D Vaihingen语义分割数据集,影像分割结果的分类精度达到89.7%,尤其在小尺度对象上具有较好分割效果.

关键词: 遥感影像语义分割, 深度残差网络, Atrous卷积, 多尺度数据增强

Abstract: As an important part of image interpretation and analysis, segmentation of remote sensing images has been widely researched. However, traditional segmentation method based on hand-crafted features has its limitations on accuracy and generalization, state-of-the-art methods are mainly relied on deep learning in recent years. In this paper, we propose a new segmentation method based on multi-scale deep residual neural networks, which aims at improving segmentation accuracy, especially on small-scale objects. We frstly utilize Residual Network (ResNet) and transform it to fully convolution networks (FCN), in which, Atrous convolution is introduced during the up-sampling process to ensure the feld of view on each layer. Then we add multi-scale data augmentation to improve the robustness for small objects. The proposed approach is applied on ISPRS 2D Vaihingen semantic labeling contest dataset, and yields high accuracy at 89.7%, outperforming most state-of-the-art methods.

Key words: semantic segmentation of remote sensing image, deep residual network, Atrous convolution, multi-scale data augmentation

中图分类号: