Assembly and analysis of the genome of Notholithocarpus densiflorus

Ying Cai, Ellis Anderson, Wen Xue, Sylvia Wong, Luman Cui,Xiaofang Cheng,Ou Wang, Qing Mao, Sophie Jia Liu,John T. Davis, Paulo R. Magalang,Douglas Schmidt,Takao Kasuga, Matteo Garbelotto, Radoje Drmanac,Chai-Shian Kua,Charles Cannon,Julin N. Maloof,Brock A. Peters

G3-GENES GENOMES GENETICS(2024)

引用 0|浏览4
暂无评分
摘要
Tanoak (Notholithocarpus densiflorus) is an evergreen tree in the Fagaceae family found in California and southern Oregon. Historically, tanoak acorns were an important food source for Native American tribes, and the bark was used extensively in the leather tanning process. Long considered a disjunct relictual element of the Asian stone oaks (Lithocarpus spp.), phylogenetic analysis has determined that the tanoak is an example of convergent evolution. Tanoaks are deeply divergent from oaks (Quercus) of the Pacific Northwest and comprise a new genus with a single species. These trees are highly susceptible to "sudden oak death" (SOD), a plant pathogen (Phytophthora ramorum) that has caused widespread deaths of tanoaks. In this study, we set out to assemble the genome and perform comparative studies among a number of individuals that demonstrated varying levels of susceptibility to SOD. First, we sequenced and de novo assembled a draft reference genome of N. densiflorus using cobarcoded library processing methods and an MGI DNBSEQ-G400 sequencer. To increase the contiguity of the final assembly, we also sequenced Oxford Nanopore long reads to 30x coverage. To our knowledge, the draft genome reported here is one of the more contiguous and complete genomes of a tree species published to date, with a contig N50 of similar to 1.2 Mb, a scaffold N50 of similar to 2.1 Mb, and a complete gene score of 95.5% through BUSCO analysis. In addition, we sequenced 11 genetically distinct individuals and mapped these onto the draft reference genome, enabling the discovery of almost 25 million single nucleotide polymorphisms and similar to 4.4 million small insertions and deletions. Finally, using cobarcoded data, we were able to generate a complete haplotype coverage of all 11 genomes.
更多
查看译文
关键词
stLFR,cobarcoding,tanoak,SOD
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要