Local and Long-Range Collaborative Learning for Remote Sensing Scene Classification.

IEEE Trans. Geosci. Remote. Sens.(2023)

Cited 4|Views15
No score
Abstract
With the development of high-resolution satellites, more and more attention has been paid to remote sensing (RS) scene classification. Convolutional neural networks (CNNs), which replace the traditional handcrafted features with a learning-based feature extraction mechanism, are widely used in scene classification. But, CNNs are less effective in deriving long-range contextual relations, which limits further improvement. Visual transformer (VT), an emerging image processing method, provides a new perspective for RS scene classification by directly acquiring long-range features. Although there have been limited works combining CNN and VT through simple concatenation, the collaborations between them are insufficient. To address these issues, we propose a local and long-range collaborative framework (L2RCF). First, we design a dual-stream structure to extract the local and long-range features. Second, a cross-feature calibration (CFC) module is designed for them to improve the representation of the fusion features. Then, combining deep supervision (DS) and deep mutual learning (DML), a novel joint loss is proposed to enhance the dual-stream feature extractor and further improve the fused features. Finally, a two-stage semi-supervised training strategy is designed to improve performance with unlabeled samples. To demonstrate the effectiveness of L2RCF, we conducted experiments on three widely used RS scene classification datasets: RSSCN7, AID, and NWPU. The results show that the L2RCF performs significantly better compared with some state-of-the-art scene classification methods.
More
Translated text
Key words
Convolutional neural network (CNN),cross-feature calibration (CFC),deep mutual learning (DML),deep supervision (DS),remote sensing (RS),scene classification,semi-supervised,visual transformer (VT)
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined