Layout Analysis and Text Column Segmentation for Historical Vietnamese Steles

Proceedings of the 5th International Workshop on Historical Document Imaging and Processing(2019)

引用 10|浏览8
暂无评分
摘要
Stone engravings in Historical Vietnamese steles allow historians to study the life of common people in the villages. Only recently, a large amount of images of such engravings have become available. For supporting the historians, automatic document analysis systems are needed for reading the ancient Chu Nôm characters that are written in columns from top to bottom. In this paper, we study the problem of layout analysis, which is the first step of automatic reading. Semantic segmentation is applied at pixel-level to find the title, main text, label, and reference number on the page using deep convolutional neural networks. Afterwards, seam carving is used to segment the text columns within the main text. We present baseline results for hundred exemplary pages, discuss error cases, and outline lines of future research.
更多
查看译文
关键词
document layout analysis, historical Vietnamese steles, seam carving, semantic segmentation, text column segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要