A genome sequence for the threatened whitebark pine

Neale, David B.; Zimin, Aleksey V.; Meltzer, Amy; Bhattarai, Akriti; Amee, Maurice (ORCID:0000000203801634); Figueroa Corona, Laura (ORCID:0000000328725428); Allen, Brian J.; Puiu, Daniela; Wright, Jessica; De La Torre, Amanda R. (ORCID:000000016647723X); McGuire, Patrick E.; Timp, Winston (ORCID:0000000320836027); Salzberg, Steven L. (ORCID:0000000288597432); Wegrzyn, Jill L. (ORCID:0000000159230888); Ingvarsson, ed., P.

doi:10.1093/g3journal/jkae061

Abstract Whitebark pine (WBP, Pinus albicaulis) is a white pine of subalpine regions in the Western contiguous United States and Canada. WBP has become critically threatened throughout a significant part of its natural range due to mortality from the introduced fungal pathogen white pine blister rust (WPBR, Cronartium ribicola) and additional threats from mountain pine beetle (Dendroctonus ponderosae), wildfire, and maladaptation due to changing climate. Vast acreages of WBP have suffered nearly complete mortality. Genomic technologies can contribute to a faster, more cost-effective approach to the traditional practices of identifying disease-resistant, climate-adapted seed sources for restoration. With deep-coverage Illumina short reads of haploid megagametophyte tissue and Oxford Nanopore long reads of diploid needle tissue, followed by a hybrid, multistep assembly approach, we produced a final assembly containing 27.6 Gb of sequence in 92,740 contigs (N50 537,007 bp) and 34,716 scaffolds (N50 2.0 Gb). Approximately 87.2% (24.0 Gb) of total sequence was placed on the 12 WBP chromosomes. Annotation yielded 25,362 protein-coding genes, and over 77% of the genome was characterized as repeats. WBP has demonstrated the greatest variation in resistance to WPBR among the North American white pines. Candidate genes for quantitative resistance include disease resistance genes known as nucleotide-binding leucine-rich repeat receptors (NLRs). A combination of protein domain alignments and direct genome scanning was employed to fully describe the 3 subclasses of NLRs. Our high-quality reference sequence and annotation provide a marked improvement in NLR identification compared to previous assessments that leveraged de novo-assembled transcriptomes.

More Like this