Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection

Computers and Electrical Engineering(2022)

引用 1|浏览3
暂无评分
摘要
Drone delivery is becoming a new trend in the logistics system, but few researches are developed in this field. Locating the target buildings in the drone camera is a crucial technique. However, it is difficult to collect extensive drone-view images and their bounding box annotations for supervised training. Therefore, we address this problem by formulating it as a weakly supervised task and using small amount of category labels as supervision. To extract representative features of cross-view and cross-device images, we propose a Hybrid Convolutional-Transformer (HCT) framework for detection given the very few image-level annotated images. To better evaluate the proposed method in the realistic drone delivery task, we build a drone-view object detection dataset based on the University-1652 benchmark by annotating bounding boxes of target buildings. Extensive experimental results demonstrate the effectiveness of the proposed method.
更多
查看译文
关键词
Vision Transformer,Few-shot learning,Weakly supervised learning,Object detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要