The IBEM dataset: A large printed scientific image dataset for indexing and searching mathematical expressions

Pattern Recognition Letters(2023)

引用 0|浏览6
暂无评分
摘要
•The new IBEM dataset has 160000+ mathematical expressions extracted from 600 documents containing 8200 page-images.•The IBEM dataset includes ground truth consisting of formula page level location and associated LaTex markup transcription.•The IBEM dataset use cases are mathematical expression detection, extraction, recognition, and retrieval.•The paper provides benchmark results on symbol classification and mathematical expression recognition for comparison purpose.
更多
查看译文
关键词
scientific image dataset,indexing,ibem
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要