Representative Image Selection for Data Efficient Word Spotting

DAS(2020)

引用 1|浏览11
暂无评分
摘要
This paper compares three different word image representations as base for label free sample selection for word spotting in historical handwritten documents. These representations are a temporal pyramid representation based on pixel counts, a graph based representation, and a pyramidal histogram of characters (PHOC) representation predicted by a PHOCNet trained on synthetic data. We show that the PHOC representation can help to reduce the amount of required training samples by up to 69% depending on the dataset, if it is learned iteratively in an active learning like fashion. While this works for larger datasets containing about \\(1\\,700\\) images, for smaller datasets with 100 images, we find that the temporal pyramid and the graph representation perform better.
更多
查看译文
关键词
representative image selection,data efficient word
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要