A robust table registration method for batch table OCR processing

MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCR(2013)

引用 2|浏览0
暂无评分
摘要
A robust table registration method is proposed in this paper for a better understanding on structured information from scanned table images. Scanned images can be heavily degraded because of scanning effects, binarization or purely document itself. For batch processing images with the same table structure, normally the table model is provided and can be used to overcome most challenging quality factors. The given table model is used as the ground truth in this paper. However, only rough precision is needed on table cell dimensions and this makes providing the table model an easier task. The method was tested on Multilingual Automatic Document Classification Analysis and Translation (MADCAT) images and a promising performance is achieved.
更多
查看译文
关键词
scanned table image,table model,table structure,robust table registration method,challenging quality factor,better understanding,table cell dimension,batch table ocr processing,multilingual automatic document classification,batch processing image,scanned image,document processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要