Efficient Joint Rectification of Photometric and Geometric Distortions in Document Images

Hao Tang, Junyuan Guo,Teng Wang,Yanwei Yu,Chao Wang

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2024)

引用 0|浏览0
暂无评分
摘要
Document images captured with cameras often exhibit photometric and geometric distortions. Here, we propose a novel learning-based approach for efficient joint rectification of document images. Inspired by the strong correlation between visual shadows and physical deformations, we design a shared encoder architecture to fully leverage structured document features. A cross-attention module is introduced to facilitate information exchange between deformation and coordinate domains. Our method effectively addresses both geometric and photometric distortions in an end-to-end manner, making it highly valuable for applications involving camera-captured document images.
更多
查看译文
关键词
Computer vision,document image rectification,cross-attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要