A Content-Based Dataset Recommendation System for Biomedical Datasets

Zitong Zhang,Yaseen Ashraf

ICICT(2023)

引用 0|浏览4
暂无评分
摘要
Nowadays, with the rapid development of cloud data and online collaboration platforms, there is a growing trend among researchers to make their data publicly available for experimental reproducibility and data reusability. On one hand, sharing data with collaborators increases the visibility of the work. On the other hand, the abundance of data on multiple platforms makes it hard for researchers to find data relevant to their own research. To overcome this challenge, a dataset recommendation system capable of finding relevant datasets from multiple resources would be helpful. In the past two decades, few dataset recommendation methods have been implemented, that are mostly domain-specific or simply recommend datasets based on keywords. We believe a general dataset recommender system that recommends datasets with information either extracted from another dataset or supplied by researchers can enhance researchers’ efficiency in searching for relevant data and significantly improve their research efficiency. This work adopts an information retrieval (IR) paradigm for dataset recommendation. By extracting summary information from each dataset and generating a profile for each, we use and compare multiple content-based recommendation methods to recommend the most-relevant datasets in GEO, SRA, and several other repositories. Our results and evaluations prove the usefulness and need for such system.
更多
查看译文
关键词
dataset recommendation,content-based recommender,genomics data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要