Chrome Extension
WeChat Mini Program
Use on ChatGLM

Automatic Captioning Based on Visible and Infrared Images

Wang Yan,Lou Shuli, Wang Kai,Yuan Xiaohu,Liu Huaping

ICRA 2024(2024)

Cited 0|Views2
No score
Abstract
In this paper, we tackle the task of image captioning with the complementarity of visible light images and infrared images. To address this problem, we propose an RGB-IR image fusion captioning model, which can take full advantage of visible light images and infrared images under different conditions. Meanwhile, we develop a wearable environment-assisted system. In addition, we collect and annotate a new dataset containing 3510 pairs of RGB-IR images to support model training. Finally, we conduct extensive experiments to evaluate the model and system. Experimental results show that our new method and system significantly outperform baselines on multiple metrics and have potential practical value.
More
Translated text
Key words
Automation Technologies for Smart Cities,Human-Centered Automation
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined