A k-mer-based pangenome approach for cataloging seed-storage-protein genes in wheat to facilitate genotype-to-phenotype prediction and improvement of end-use quality

Molecular Plant(2024)

Cited 0|Views2
No score
Abstract
Wheat (Triticum aestivum L.) is a staple food for more than 35% of the world's population, with flour used to make hundreds of baked goods. Superior end-use quality is a major breeding target, however, improving it is especially time-consuming and expensive. Furthermore, genes encoding seed-storage proteins (SSPs) form multi-gene families and are repetitive, with gaps commonplace in several genome assemblies. To overcome these barriers and efficiently identify superior wheat SSP alleles, we developed ‘PanSK’ (Pan-SSP k-mer) for genotype-to-phenotype prediction based on a SSP-based pangenome resource. PanSK uses 29-mer sequences that represent each SSP gene at the pangenomic level to reveal untapped diversity across landraces and modern cultivars. Genome-wide association studies with k-mer identified 23 SSP genes associated with end-use quality representing novel targets for improvement. We evaluated the effect of rye secalin genes on end-use quality and found that removing ω-secalins from 1BL/1RS wheat translocation lines is associated with enhanced end-use quality. Finally, using machine-learning-based prediction inspired by PanSK, we predict quality phenotypes with high accuracy from genotype alone. This study provides an effective approach for genome design based on SSP genes, enabling breeding of wheat varieties with superior processing capabilities and improved end-use quality.
More
Translated text
Key words
wheat,seed-storage protein,end-use quality,k-mer,pangenome,genomic prediction
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined