Sequencing refractory regions in bird genomes are hotspots for accelerated protein evolution

R. Huttener,L. Thorrez,T. In’t Veld, M. Granvik,L. Van Lommel, E. Waelkens, R. Derua,K. Lemaire, L. Goyvaerts, S. De Coster, J. Buyse,F. Schuit

BMC ECOLOGY AND EVOLUTION(2021)

引用 7|浏览21
暂无评分
摘要
Background Approximately 1000 protein encoding genes common for vertebrates are still unannotated in avian genomes. Are these genes evolutionary lost or are they not yet found for technical reasons? Using genome landscapes as a tool to visualize large-scale regional effects of genome evolution, we reexamined this question. Results On basis of gene annotation in non-avian vertebrate genomes, we established a list of 15,135 common vertebrate genes. Of these, 1026 were not found in any of eight examined bird genomes. Visualizing regional genome effects by our sliding window approach showed that the majority of these "missing" genes can be clustered to 14 regions of the human reference genome. In these clusters, an additional 1517 genes (often gene fragments) were underrepresented in bird genomes. The clusters of “missing” genes coincided with regions of very high GC content, particularly in avian genomes, making them “hidden” because of incomplete sequencing. Moreover, proteins encoded by genes in these sequencing refractory regions showed signs of accelerated protein evolution. As a proof of principle for this idea we experimentally characterized the mRNA and protein products of four "hidden" bird genes that are crucial for energy homeostasis in skeletal muscle: ALDOA , ENO3 , PYGM and SLC2A4 . Conclusions A least part of the “missing” genes in bird genomes can be attributed to an artifact caused by the difficulty to sequence regions with extreme GC% (“hidden” genes). Biologically, these “hidden” genes are of interest as they encode proteins that evolve more rapidly than the genome wide average. Finally we show that four of these “hidden” genes encode key proteins for energy metabolism in flight muscle.
更多
查看译文
关键词
Avian genomes, Evolution, Accelerated, Sequencing artifacts, Transcript landscapes, Missing genes, GLUT4, ENO3, ALDOA, PYGM
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要