Fully automatic resolution of untargeted GC-MS data with deep learning assistance.

Talanta(2022)

引用 10|浏览10
暂无评分
摘要
DeepResolution (Deep learning-assisted multivariate curve Resolution) has been proposed to solve the co-eluting problem for GC-MS data. However, DeepResolution models must be retrained when encountering unknown components, which is undoubtedly time-consuming and burdensome. In this study, a new pipeline named DeepResoution2 was proposed to overcome these limitations. DeepResolution2 utilizes deep neural networks to divide the profile into segments, estimate the number of components in each segment, and predict the elution region of each component. Subsequently, the information obtained by these deep learning models is used to assist the multivariate curve resolution procedure. Only seven models (1 + 1 + 5) are required to automate the whole analysis procedure of untargeted GC-MS data, which is an important improvement over DeepResolution. These seven models are stable and universal. Once established, they can be used to resolve most GC-MS data. Compared with MS-DIAL, ADAP-GC, and AMDIS, DeepResolution2 can obtain more reasonable mass spectra, chromatograms and peak areas to identify and quantify compounds. DeepResoution2 (0.955) outperformed AMDIS (0.939), MS-DIAL (0.948) and ADAP-GC (0.860) in terms of the linear correlation between concentrations and peak areas on overlapped peaks in fatty acid dataset. In real biological samples of human male infertility plasma, the peak areas and mass spectra of 136 untargeted GC-MS files were automatically extracted by DeepResolution2 without any prior information and manual intervention. DeepResolution2 includes all the functions for analyzing untargeted GC-MS datasets from the feature extraction of raw data files to the establishment of discriminant models.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要