应用科学学报 ›› 2022, Vol. 40 ›› Issue (1): 131-144.doi: 10.3969/j.issn.0255-8297.2022.01.012

• 计算机应用专辑 • 上一篇    下一篇

基于MFANet和上下文特征融合的遥感影像目标检测

汪鹏, 郑文凤, 史进, 金硕, 刘子豪   

  1. 河北工业大学 人工智能与数据科学学院, 天津 300401
  • 收稿日期:2021-07-23 出版日期:2022-01-28 发布日期:2022-01-28
  • 通信作者: 史进,硕士,助理研究员,研究方向为人工智能。E-mail:shijin@hebut.edu.cn E-mail:shijin@hebut.edu.cn
  • 基金资助:
    国家重点研发计划基金(No.2019YFC1904601);国家自然科学基金(No.61902106,No.61806072);河北省自然科学基金(No.F2020202028)资助

Remote Sensing Image Object Detection Based on MFANet and Contextual Features Fusion

WANG Peng, ZHENG Wenfeng, SHI Jin, JIN Shuo, LIU Zihao   

  1. School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, China
  • Received:2021-07-23 Online:2022-01-28 Published:2022-01-28

摘要: 针对遥感影像背景复杂、目标尺度变化较大、类间相似性较高等特点而导致目标检测效果欠佳的问题,提出一种基于Faster R-CNN的有效且鲁棒的遥感影像目标检测方法。首先,引入可变形卷积、调制机制和空洞卷积,构造调制的特征自适应网络,提取更准确、更完整的目标信息。其次,构造上下文特征金字塔网络,提取更丰富且更具判别性的特征表示来解决高层语义信息不足和多尺寸感受野之间缺乏有效沟通的问题。最后,在边界框回归中引入CIoU (complete IoU) LOSS,进一步提高目标检测的精度。为了验证所提方法的有效性,在公共数据集DIOR、RSOD和NWPU VHR-10上进行实验。结果表明:与Faster R-CNN with FPN方法相比,IF-RCNN在3个数据集上的平均检测精度分别获得了8.43%、7.5%和8.0%的绝对增益,证明了所提方法的有效性。

关键词: 目标检测, 遥感影像, 特征金字塔, 特征自适应

Abstract: Remote sensing images have the characteristics of complex background, large variations of object sizes and inter-class similarity, which lead to poor object detection results. An effective and robust remote sensing image object detection method based on Faster R-CNN is proposed. First, we introduce deformable convolution, feature modulation mechanisms and dilated convolution to construct a modulated feature adaptation network named MFANet, which can extract more accurate and complete object information. Second, a contextual feature pyramid network named CFPN is introduced to exploit richer and more discriminative feature representations. CFPN can solve the problems of insufficient high-level semantic information in the process of feature transfer and lack of effective communication between multi-size receptive fields. Finally, complete IoU (CIoU) loss is introduced into bounding box regression to further improve the accuracy of object detection. To verify the validity of the proposed method, we conduct experiments on public datasets DIOR, RSOD, and NWPU VHR-10. Experimental results show that compared with the Faster R-CNN with FPN method, IF-RCNN obtains an absolute gain of 8.43%, 7.5% and 8.0% in the average detection accuracy on the three datasets, respectively, which suggests that our proposed method is more effective and robust.

Key words: object detection, remote sensing image, feature pyramid, feature adaptation

中图分类号: