Multi-Modality Latent Interaction Network for Visual Question Answering
2019 IEEE/CVF International Conference on Computer Vision (ICCV)(2019)
Key words
visual language information,multimodality latent interaction module,cross-modality relationships,latent visual language summarizations,latent representations,cross-modality information,latent summarizations,visual word features,multimodality latent interaction network,visual question answering,multimodality features,VQA benchmark,TDIUC benchmark
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined