Fine-grained image emotion captioning based on Generative Adversarial Networks

Chunmiao Yang,Yang Wang,Liying Han, Xiran Jia, Hebin Sun

Multimedia Tools and Applications(2024)

引用 0|浏览0
暂无评分
摘要
Image captioning, which combines natural language processing and computer vision, has developed rapidly in recent years. It tends to be applied in data retrieval, blind navigation, intelligent transportation, smart home, medical assistance, news media and other domains. In order to elevate the consistency and abundance of image captioning languages and express people's subjective emotions effectively, a Generative Adversarial Network (GAN) is applied in this paper to obtain multi-stylized image emotion captions and generate two captions containing positive and negative emotions, respectively. Among them, Residual Network (ResNet) and Gate Recurrent Unit (GRU) are integrated into the generator, while the capsule neural network is applied to the discriminator. We conduct experiments on the popular MSCOCO and Senticap datasets to validate the model and demonstrate its satisfied performance in comparison to current advanced image captioning approaches.
更多
查看译文
关键词
Generative Adversarial Network (GAN),Gate Recurrent Unit (GRU),Capsule neural network,Image emotion captioning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要