ViT-YOLO:Transformer-Based YOLO for Object Detection
2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)(2021)
Key words
improved backbone MHSA-Darknet,differentiated features,object detection,multihead self-attention,path-aggregation neck,cross-scale feature fusion,time-test augmentation,test-challenge data,drone,distractors,general object detectors,convolutional networks,vision backbone architectures,global context information,effective weighted bidirectional feature pyramid network,wighted boxes fusion,ViT-YOLO
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined