Image annotation and curation in radiology: an overview for machine learning practitioners

European Radiology Experimental(2024)

引用 0|浏览1
暂无评分
摘要
“Garbage in, garbage out” summarises well the importance of high-quality data in machine learning and artificial intelligence. All data used to train and validate models should indeed be consistent, standardised, traceable, correctly annotated, and de-identified, considering local regulations. This narrative review presents a summary of the techniques that are used to ensure that all these requirements are fulfilled, with special emphasis on radiological imaging and freely available software solutions that can be directly employed by the interested researcher. Topics discussed include key imaging concepts, such as image resolution and pixel depth; file formats for medical image data storage; free software solutions for medical image processing; anonymisation and pseudonymisation to protect patient privacy, including compliance with regulations such as the Regulation (EU) 2016/679 “General Data Protection Regulation” (GDPR) and the 1996 United States Act of Congress “Health Insurance Portability and Accountability Act” (HIPAA); methods to eliminate patient-identifying features within images, like facial structures; free and commercial tools for image annotation; and techniques for data harmonisation and normalisation. Relevance statement This review provides an overview of the methods and tools that can be used to ensure high-quality data for machine learning and artificial intelligence applications in radiology. Key points • High-quality datasets are essential for reliable artificial intelligence algorithms in medical imaging. • Software tools like ImageJ and 3D Slicer aid in processing medical images for AI research. • Anonymisation techniques protect patient privacy during dataset preparation. • Machine learning models can accelerate image annotation, enhancing efficiency and accuracy. • Data curation ensures dataset integrity, compliance, and quality for artificial intelligence development. Graphical Abstract
更多
查看译文
关键词
Artificial intelligence,Data curation,Image processing (computer-assisted),Machine learning,Privacy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要