Pedestrian Tracking Based on Receptive Field Improvement: A One-Shot Multiobject Tracking Approach Based on Vision Sensors

IEEE SENSORS JOURNAL(2023)

Cited 0|Views9
No score
Abstract
Multiobject tracking (MOT) in video sequences has gradually become one of the most essential fields in computer vision tasks. As the use of two separate models for feature extraction in two-shot methods precludes them from achieving real-time inference, one-shot methods have been developed to accelerate the running speed by integrating the position estimation of interested targets and extraction of the corresponding appearance embedding features. However, the one-shot MOT accuracy is not as good as that of the two-shot methods. In our study, we design a reinforced one-shot MOT system to further promote the MOT capability. First of all, for the problem of target scale transformation, a receptive field module (RFM) that applies dilated convolutions to acquire diverse receptive fields is presented. Then, we design an attention mechanism network to extract channel and positional information. Finally, we combine the circle loss with Euclidean distance optimization and cross-entropy loss to enhance the learning of discriminative embeddings. Three public pedestrian tracking datasets are applied to verify the effectiveness and superiority of our presented algorithm. In particular, we achieve 79.0 MOT accuracy (MOTA) and 78.4 identification F-score (IDF1) on the train set of MOT16 with 24.1 frames/s inference speed on an NVIDIA GeForce RTX 3090 GPU.
More
Translated text
Key words
Autonomous driving,deep learning (DL),multiobject tracking (MOT),one-shot,receptive field
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined