End-to-End Multilingual Text Recognition Based on Byte Modeling.

Jiajia Wu, Kun Zhao, Zhengyan Yang,Bing Yin,Cong Liu,Li-Rong Dai

Image and Graphics : 12th International Conference, ICIG 2023, Nanjing, China, September 22–24, 2023, Proceedings, Part III(2023)

引用 0|浏览14
暂无评分
摘要
Nowadays, multilingual text recognition is more and more widely used in computer vision. However, in practical applications, the independent modeling of each language cannot make full use of the information between different languages and consumes hardware resources very much, which makes the unified modeling of multiple languages very necessary. A natural approach to unified multilingual modeling is to combine modeling units (characters, subwords, or words) from all languages into a large vocabulary, and then use a sequence-to-sequence approach to modeling. However, this vocabulary is often very large making modeling difficult. In this paper, we propose a byte-based multilingual text recognition method, which makes the vocabulary size only 256, which effectively solves the problem of unified modeling. The experiments show that our method effectively utilizes the information between different languages and outperforms the baseline of independent modeling by a large margin.
更多
查看译文
关键词
multilingual text recognition,byte modeling,end-to-end
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要