BCRNet: Bidirectional contrastive representation network for deep multimodal learning of exercise representations in online education systems

Neurocomputing(2023)

引用 0|浏览5
暂无评分
摘要
In online education systems, learning exercise representation is a fundamental task in many applications, such as exercise retrieval and recommendation. To capture the heterogeneous data information of exercises (i.e., texts and images), deep multimodal approaches show the promising performance. However, these methods have two limitations: (1) they only care about context on one side, which fails to utilize the future context chunks in the exercises; (2) they cannot ensure the presentation ability due to the scarcity of labelled data. In this paper, we propose a bidirectional contrastive representation network (BCRNet) to tackle these issues. First, we construct a representation module with a masking constraint loss to take into account the bidirectional context contents of exercises. Second, we design a contrastive learning approach which uses a multimodal contrastive loss to reshape the multimodal representation space and improve model presentation ability without labelled data. Moreover, a text-image matching strategy is designed to provide semantic links between texts and images. On the real-world dataset, experiments demonstrate BCRNet performs significantly better than many strong baselines.
更多
查看译文
关键词
bidirectional contrastive representation network,exercise representations,deep multimodal learning,online education systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要