Electron ionization mass spectrometry feature peak relationships combined with deep classification model to assist similarity algorithm for fast and accurate identification of compounds.

Jiajiang Zhang, Junhui Li, Binbin Ding, Qusheng Qiao, Haixing Wang, Yanchao Liang, Peihong Song, Yong Wu, Chenlu Wang,Keqi Tang,Jiancheng Yu

Rapid communications in mass spectrometry : RCM(2024)

Cited 0|Views1
No score
Abstract
RATIONALE:Gas chromatography-mass spectrometry (GC-MS) combines chromatography and MS, providing full play to the advantages of high separation efficiency of GC, strong qualitative ability of MS, and high sensitivity of detector. In GC-MS data processing, determining the experimental compounds is one of the most important analytical steps, which is usually realized by one-to-one similarity calculations between the experimental mass spectrum and the standard mass spectrum library. Although the accuracy of the algorithm has been improved in recent years, it is still difficult to distinguish structurally similar mass spectra, especially isomers. At the same time, the library capacity is very large and increasing every year, and the algorithm needs to perform large numbers of calculations with irrelevant compounds in the library to recognize unknown compounds, which leads to a significant reduction in efficiency. METHODS:This work proposed to exclude a large number of irrelevant mass spectra by presearching, perform preliminary similarity calculations using similarity algorithms, and finally improve the accuracy of similarity calculations using deep classification models. The replica library of NIST17 is used as the query data, and the master library is used as the reference database. RESULTS:Compared with the traditional recognition algorithm, the preprocessing algorithm has reduced the time by 4.2 h, and by adding the deep learning models 1 and 2 as the final determination, the recognition accuracy has been improved by 1.9% and 6.5%, respectively, based on the original algorithm. CONCLUSIONS:This method improves the recognition efficiency compared to conventional algorithms and at the same time has better recognition accuracy for structurally similar mass spectra and isomers.
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined