skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The chicken or the egg? Plastome evolution and an independent loss of the inverted repeat in papilionoid legumes
Summary The plastid genome (plastome), while surprisingly constant in gene order and content across most photosynthetic angiosperms, exhibits variability in several unrelated lineages. During the diversification history of the legume family Fabaceae, plastomes have undergone many rearrangements, including inversions, expansion, contraction and loss of the typical inverted repeat (IR), gene loss and repeat accumulation in both shared and independent events. While legume plastomes have been the subject of study for some time, most work has focused on agricultural species in the IR‐lacking clade (IRLC) and the plant modelMedicago truncatula. The subfamily Papilionoideae, which contains virtually all of the agricultural legume species, also comprises most of the plastome variation detected thus far in the family. In this study three non‐papilioniods were included among 34 newly sequenced legume plastomes, along with 33 publicly available sequences, to assess plastome structural evolution in the subfamily. In an effort to examine plastome variation across the subfamily, approximately 20% of the sampling represents the IRLC with the remainder selected to represent the early‐branching papilionoid clades. A number of IR‐related and repeat‐mediated changes were identified and examined in a phylogenetic context. Recombination between direct repeats associated withycf2resulted in intraindividual plastome heteroplasmy. Although loss of the IR has not been reported in legumes outside of the IRLC, one genistoid taxon was found to completely lack the typical plastome IR. The role of the IR and non‐IR repeats in the progression of plastome change is discussed.  more » « less
Award ID(s):
1853024
PAR ID:
10449648
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
The Plant Journal
Volume:
107
Issue:
3
ISSN:
0960-7412
Page Range / eLocation ID:
p. 861-875
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The inverted repeat (IR) lacking clade (IRLC) is a monophyletic group within the Papilionoideae subfamily of Fabaceae where plastid genomes (plastomes) do not contain the large IR typical of land plants. Recently, an IRLC legume,Medicago minima, was found to have regrown a ~9 kb IR that contained a number of canonical IR genes, and closely relatedM. lupulinacontained an incomplete IR of ~425 bp. Complete plastomes were generated for seven additional species, putative members of theM. minimaclade. Polymerase chain reaction was employed to investigate the presence of the IR acrossM. minimaandM. lupulinaincluding individuals of nine and eight Eurasian and North African accessions and 15 and 14 Texas populations, respectively. While no sequence similar to the ~9 kb IR was detected among the seven newly sequenced plastomes, all Eurasian and North African accessions ofM. minimacontained the IR. Variation in IR extent was detected within and between the Texas populations. Expansions of 13 bp and 11 bp occurred at the boundaries of both IR/small single‐copy regions, and populations had one or the other expansion, but not both. Expansion of the IR was not detected in the accessions from Eurasia and North Africa suggesting recent mutations yielded at least two additional plastid haplotypes inM. minima. 
    more » « less
  2. Abstract Although plastid genome (plastome) structure is highly conserved across most seed plants, investigations during the past two decades have revealed several disparately related lineages that experienced substantial rearrangements. Most plastomes contain a large inverted repeat and two single‐copy regions, and a few dispersed repeats; however, the plastomes of some taxa harbour long repeat sequences (>300 bp). These long repeats make it challenging to assemble complete plastomes using short‐read data, leading to misassemblies and consensus sequences with spurious rearrangements. Single‐molecule, long‐read sequencing has the potential to overcome these challenges, yet there is no consensus on the most effective method for accurately assembling plastomes using long‐read data. We generated a pipeline,plastidGenomeAssemblyUsingLong‐read data (ptGAUL), to address the problem of plastome assembly using long‐read data from Oxford Nanopore Technologies (ONT) or Pacific Biosciences platforms. We demonstrated the efficacy of the ptGAUL pipeline using 16 published long‐read data sets. We showed that ptGAUL quickly produces accurate and unbiased assemblies using only ~50× coverage of plastome data. Additionally, we deployed ptGAUL to assemble four newJuncus(Juncaceae) plastomes using ONT long reads. Our results revealed many long repeats and rearrangements inJuncusplastomes compared with basal lineages of Poales. The ptGAUL pipeline is available on GitHub:https://github.com/Bean061/ptgaul. 
    more » « less
  3. Comprising 501 genera and around 14,000 species, Papilionoideae is not only the largest subfamily of Fabaceae (Leguminosae; legumes), but also one of the most extraordinarily diverse clades among angiosperms. Papilionoids are a major source of food and forage, are ecologically successful in all major biomes, and display dramatic variation in both floral architecture and plastid genome (plastome) structure. Plastid DNA-based phylogenetic analyses have greatly improved our understanding of relationships among the major groups of Papilionoideae, yet the backbone of the subfamily phylogeny remains unresolved. In this study, we sequenced and assembled 39 new plastomes that are covering key genera representing the morphological diversity in the subfamily. From 244 total taxa, we produced eight datasets for maximum likelihood (ML) analyses based on entire plastomes and/or concatenated sequences of 77 protein-coding sequences (CDS) and two datasets for multispecies coalescent (MSC) analyses based on individual gene trees. We additionally produced a combined nucleotide dataset comprising CDS plus matK gene sequences only, in which most papilionoid genera were sampled. A ML tree based on the entire plastome maximally supported all of the deep and most recent divergences of papilionoids (223 out of 236 nodes). The Swartzieae, ADA (Angylocalyceae, Dipterygeae, and Amburaneae), Cladrastis, Andira, and Exostyleae clades formed a grade to the remainder of the Papilionoideae, concordant with nine ML and two MSC trees. Phylogenetic relationships among the remaining five papilionoid lineages (Vataireoid, Dermatophyllum , Genistoid s.l., Dalbergioid s.l., and Baphieae + Non-Protein Amino Acid Accumulating or NPAAA clade) remained uncertain, because of insufficient support and/or conflicting relationships among trees. Our study fully resolved most of the deep nodes of Papilionoideae, however, some relationships require further exploration. More genome-scale data and rigorous analyses are needed to disentangle phylogenetic relationships among the five remaining lineages. 
    more » « less
  4. Abstract The complete chloroplast and mitochondrial genomes of Charophyta have shed new light on land plant terrestrialization. Here, we report the organellar genomes of the Zygnema circumcarinatum strain UTEX 1559, and a comparative genomics investigation of 33 plastomes and 18 mitogenomes of Chlorophyta, Charophyta (including UTEX 1559 and its conspecific relative SAG 698-1a), and Embryophyta. Gene presence/absence was determined across these plastomes and mitogenomes. A comparison between the plastomes of UTEX 1559 (157 548 bp) and SAG 698-1a (165 372 bp) revealed very similar gene contents, but substantial genome rearrangements. Surprisingly, the two plastomes share only 85.69% nucleotide sequence identity. The UTEX 1559 mitogenome size is 215 954 bp, the largest among all sequenced Charophyta. Interestingly, this large mitogenome contains a 50 kb region without homology to any other organellar genomes, which is flanked by two 86 bp direct repeats and contains 15 ORFs. These ORFs have significant homology to proteins from bacteria and plants with functions such as primase, RNA polymerase, and DNA polymerase. We conclude that (i) the previously published SAG 698-1a plastome is probably from a different Zygnema species, and (ii) the 50 kb region in the UTEX 1559 mitogenome might be recently acquired as a mobile element. 
    more » « less
  5. Introgression can produce novel genetic variation in organisms that hybridize. Sympatric species pairs in the carnivorous plant genusSarraceniaL. frequently hybridize, and all known hybrids are fertile. Despite being a desirable system for studying the evolutionary consequences of hybridization, the extent to which introgression occurs in the genus is limited to a few species in only two field sites. Previous phylogenomic analysis ofSarraceniaestimated a highly resolved species tree from 199 nuclear genes, but revealed a plastid genome that is highly discordant with the species tree. Such cytonuclear discordance could be caused by chloroplast introgression (i.e. chloroplast capture) or incomplete lineage sorting (ILS). To better understand the extent to which introgression is occurring inSarracenia, the chloroplast capture and ILS hypotheses were formally evaluated. Plastomes were assembledde-novofrom sequencing reads generated from 17 individuals in addition to reads obtained from the previous study. Assemblies of 14 whole plastomes were generated and annotated, and the remaining fragmented assemblies were scaffolded to these whole-plastome assemblies. Coding sequence from 79 homologous genes were aligned and concatenated for maximum-likelihood phylogeny estimation. The plastome tree is extremely discordant with the published species tree. Plastome trees were simulated under the coalescent and tree distance from the species tree was calculated to generate a null distribution of discordance that is expected under ILS alone. A t-test rejected the null hypothesis that ILS could cause the level of discordance seen in the plastome tree, suggesting that chloroplast capture must be invoked to explain the discordance. Due to the extreme level of discordance in the plastome tree, it is likely that chloroplast capture has been common in the evolutionary history ofSarracenia. 
    more » « less