From Multi-Scale Grids to Dynamic Regions: Dual-relation Enhanced Transformer for Image Captioning
Knowledge-Based Systems(2025)
Key words
Image captioning,Transformer,Multi-scale features,Dynamic region selection,Adaptive gating mechanism
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined