A multi-scale small object detection algorithm SMA-YOLO for UAV remote sensing images.

Shilong Zhou, Haijin Zhou, Lei Qian
Author Information
  1. Shilong Zhou: Key Laboratory of Environmental Optics and Technology, Anhui Institute of Optics and Fine Mechanics, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, China.
  2. Haijin Zhou: Key Laboratory of Environmental Optics and Technology, Anhui Institute of Optics and Fine Mechanics, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, China. hjzhou@aiofm.ac.cn.
  3. Lei Qian: Key Laboratory of Environmental Optics and Technology, Anhui Institute of Optics and Fine Mechanics, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, China.

Abstract

Detecting small objects in complex remote sensing environments presents significant challenges, including insufficient extraction of local spatial information, rigid feature fusion, and limited global feature representation. In addition, improving model performance requires a delicate balance between improving accuracy and managing computational complexity. To address these challenges, we propose the SMA-YOLO algorithm. First, we introduce the Non-Semantic Sparse Attention (NSSA) mechanism in the backbone network, which efficiently extracts non-semantic features related to the task, thus improving the model's sensitivity to small objects. In the model's throat, we design a Bidirectional Multi-Branch Auxiliary Feature Pyramid Network (BIMA-FPN), which integrates high-level semantic information with low-level spatial details, improving small object detection while expanding multi-scale receptive fields. Finally, we incorporate a Channel-Space Feature Fusion Adaptive Head (CSFA-Head), which fully handles multi-scale features and adaptively handles consistency problems of different scales, further improving the robustness of the model in complex scenarios. Experimental results on the VisDrone2019 dataset show that SMA-YOLO achieves a 13% improvement in mAP compared to the baseline model, demonstrating exceptional adaptability in small object detection tasks for remote sensing imagery. These results provide valuable insights and new approaches to further advance research in this area.

Keywords

References

  1. IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149 [PMID: 27295650]
  2. IEEE Trans Image Process. 2021;30:9099-9111 [PMID: 34735334]
  3. Sensors (Basel). 2023 Aug 15;23(16): [PMID: 37631727]

Word Cloud

Created with Highcharts 10.0.0smallimprovingsensingdetectionremotemodelSMA-YOLOFeatureobjectmulti-scaleobjectscomplexchallengesspatialinformationfeaturefusionalgorithmfeaturesmodel'shandlesresultsimagesDetectingenvironmentspresentssignificantincludinginsufficientextractionlocalrigidlimitedglobalrepresentationadditionperformancerequiresdelicatebalanceaccuracymanagingcomputationalcomplexityaddressproposeFirstintroduceNon-SemanticSparseAttentionNSSAmechanismbackbonenetworkefficientlyextractsnon-semanticrelatedtaskthussensitivitythroatdesignBidirectionalMulti-BranchAuxiliaryPyramidNetworkBIMA-FPNintegrateshigh-levelsemanticlow-leveldetailsexpandingreceptivefieldsFinallyincorporateChannel-SpaceFusionAdaptiveHeadCSFA-HeadfullyadaptivelyconsistencyproblemsdifferentscalesrobustnessscenariosExperimentalVisDrone2019datasetshowachieves13%improvementmAPcomparedbaselinedemonstratingexceptionaladaptabilitytasksimageryprovidevaluableinsightsnewapproachesadvanceresearchareaUAVMulti-branchauxiliaryObjectRemote

Similar Articles

Cited By