YOLO-DA: An Efficient YOLO-Based Detector for Remote Sensing Object Detection

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS(2023)

引用 1|浏览9
暂无评分
摘要
In the past few decades, many efficient object detectors have been proposed for natural scene image object detection (OD). However, due to the complex scenes and high interclass similarity of optical remote sensing (RS) images, applying these detectors to optical RS images directly is not very effective. Most of the recent detectors pursue higher accuracy while ignoring the balance between detection accuracy and speed, which hinders the practical application of these detectors, especially in embedded devices. To meet these challenges, a fast and accurate detector based on you only look once (YOLO) with decoupled attention head (YOLO-DA) is proposed, which effectively improves detection performance while only introducing minimal complexity. Specifically, an attention module at the end of the detector is designed for guiding a neural network to extract more efficient features from the complex background while also minimizing the amount of additional computation. Moreover, a lightweight decoupled detection head with enhanced classification and localization capability is developed to detect objects with high interclass similarity. In the experiments, the proposed method effectively solves the problem of high interclass similarity and improves the mean average precision (mAP) by 6.8% on the fine-grained optical RS dataset SIMD, compared with YOLOv5-L. In addition, the proposed method improves the mAP by 1.0%, 1.7%, and 0.6% on the other three publicly open optical RS datasets, respectively. Experimental results on detection accuracy and inference time demonstrate that our method achieves the best trade-off between detection performance and speed.
更多
查看译文
关键词
Detectors,Head,Feature extraction,Optical imaging,Optical detectors,Task analysis,Object detection,Attention,convolutional neural networks (CNNs),object detection (OD),remote sensing (RS) images
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要