Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection

Shengming Li,Linsong Xue,Lin Feng,Cuili Yao,Dong Wang

Computers and Electrical Engineering（2022）

引用 1|浏览3

暂无评分

摘要

Drone delivery is becoming a new trend in the logistics system, but few researches are developed in this field. Locating the target buildings in the drone camera is a crucial technique. However, it is difficult to collect extensive drone-view images and their bounding box annotations for supervised training. Therefore, we address this problem by formulating it as a weakly supervised task and using small amount of category labels as supervision. To extract representative features of cross-view and cross-device images, we propose a Hybrid Convolutional-Transformer (HCT) framework for detection given the very few image-level annotated images. To better evaluate the proposed method in the realistic drone delivery task, we build a drone-view object detection dataset based on the University-1652 benchmark by annotating bounding boxes of target buildings. Extensive experimental results demonstrate the effectiveness of the proposed method.

查看译文

关键词

Vision Transformer,Few-shot learning,Weakly supervised learning,Object detection

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要