Single-molecule real-time sequencing of the full-length transcriptome of purple garlic (Allium sativum L. cv. Leduzipi) and identification of serine O-acetyltransferase family proteins involved in cysteine biosynthesis

Le Wang, Chao Zhang,Wei Yin,Wei Wei, Yonghong Wang,Wei Sa,Jian Liang

JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE(2022)

引用 3|浏览16
暂无评分
摘要
BACKGROUND: Garlic (Allium sativum L.), whose bioactive components are mainly organosulfur compounds (OSCs), is a herbaceous perennial widely consumed as a green vegetable and a condiment. Yet, the metabolic enzymes involved in the biosynthesis of OSCs are not identified in garlic. RESULTS: Here, a full-length transcriptome of purple garlic was generated via PacBio and Illumina sequencing, to characterize the garlic transcriptome and identify key proteins mediating the biosynthesis of OSCs. Overall, 22.56 Gb of clean data were generated, resulting in 454 698 circular consensus sequence (CCS) reads, of which 83.4% (379 206) were identified as being fulllength non-chimeric reads - their further transcript clustering facilitated identification of 36 571 high-quality consensus reads. Once corrected, their genome-wide mapping revealed that 6140 reads were novel isoforms of known genes, and 2186 reads were novel isoforms from novel genes. We detected 1677 alternative splicing events, finding 2902 genes possessing either two or more poly(A) sites. Given the importance of serine O-acetyltransferase (SERAT) in cysteine biosynthesis, we investigated the five SERAT homologs in garlic. Phylogenetic analysis revealed a three-tier classification of SERAT proteins, each featuring a serine acetyltransferase domain (N-terminal) and one or two hexapeptide transferase motifs. Template-based modeling showed that garlic SERATs shared a common homo-trimeric structure with homologs from bacteria and other plants. The residues responsible for substrate recognition and catalysis were highly conserved, implying a similar reaction mechanism. In profiling the five SERAT genes' transcript levels, their expression pattern varied significantly among different tissues. CONCLUSION: This study's findings deepen our knowledge of SERAT proteins, and provide timely genetic resources that could advance future exploration into garlic's genetic improvement and breeding. (C) 2021 Society of Chemical Industry. Supporting information may be found in the online version of this article.
更多
查看译文
关键词
garlic, full-length transcriptome, organosulfur compounds, cysteine, serine O-acetyltransferase
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要