End-to-end text recognition with convolutional neural networks

ICPR(2012)

引用 1127|浏览123
暂无评分
摘要
Full end-to-end text recognition in natural images is a challenging problem that has received much attention recently. Traditional systems in this area have relied on elaborate models incorporating carefully hand-engineered features or large amounts of prior knowledge. In this paper, we take a different route and combine the representational power of large, multilayer neural networks together with recent developments in unsupervised feature learning, which allows us to use a common framework to train highly-accurate text detector and character recognizer modules. Then, using only simple off-the-shelf methods, we integrate these two modules into a full end-to-end, lexicon-driven, scene text recognition system that achieves state-of-the-art performance on standard benchmarks, namely Street View Text and ICDAR 2003.
更多
查看译文
关键词
off-the-shelf method,street view text,convolutional neural network,text detection,scene text recognition system,multilayer perceptrons,feature extraction,character recognizer module,lexicon driven recognition system,natural scenes,handwritten character recognition,icdar 2003,unsupervised feature learning,multilayer neural network,unsupervised learning,end-to-end text recognition,natural image processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要