Web acquired image datasets need curation: an examplar pipeline evaluated on Greek food images
2021 IEEE International Conference on Imaging Systems and Techniques (IST)(2021)
摘要
Mining Web data to create AI-usable datasets, is still non-trivial. Unfortunately, despite the free data access, the formation of a dataset useful for machine learning applications cannot rely solely on a data mining phase. For any given query, the retrieved sample may include duplicated, misclassified or completely irrelevant content. The consequence of not “cleaning” those datasets is to end up ...
更多查看译文
关键词
Conferences,Pipelines,Imaging,Machine learning,Ontologies,Cleaning,Data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要