skip to main content

Title: Novel genetic code and record-setting AT-richness in the highly reduced plastid genome of the holoparasitic plant Balanophora

Plastid genomes (plastomes) vary enormously in size and gene content among the many lineages of nonphotosynthetic plants, but key lineages remain unexplored. We therefore investigated plastome sequence and expression in the holoparasitic and morphologically bizarre Balanophoraceae. The twoBalanophoraplastomes examined are remarkable, exhibiting features rarely if ever seen before in plastomes or in any other genomes. At 15.5 kb in size and with only 19 genes, they are among the most reduced plastomes known. They have no tRNA genes for protein synthesis, a trait found in only three other plastid lineages, and thusBalanophoraplastids must import all tRNAs needed for translation.Balanophoraplastomes are exceptionally compact, with numerous overlapping genes, highly reduced spacers, loss of allcis-spliced introns, and shrunken protein genes. With A+T contents of 87.8% and 88.4%, theBalanophoragenomes are the most AT-rich genomes known save for a single mitochondrial genome that is merely bloated with AT-rich spacer DNA. Most plastid protein genes inBalanophoraconsist of ≥90% AT, with several between 95% and 98% AT, resulting in the most biased codon usage in any genome described to date. A potential consequence of its radical compositional evolution is the novel genetic code used byBalanophoraplastids, in which TAG has been reassigned from stop to tryptophan. Despite its more » many exceptional properties, theBalanophoraplastome must be functional because all examined genes are transcribed, its only intron is correctlytrans-spliced, and its protein genes, although highly divergent, are evolving under various degrees of selective constraint.

« less
Authors:
; ; ; ; ; ; ; ; ;
Publication Date:
NSF-PAR ID:
10082478
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
116
Issue:
3
Page Range or eLocation-ID:
p. 934-943
ISSN:
0027-8424
Publisher:
Proceedings of the National Academy of Sciences
Sponsoring Org:
National Science Foundation
More Like this
  1. Background

    In the past three decades, several studies have predominantly relied on a small sample of the plastome to infer deep phylogenetic relationships in the species-rich Melastomataceae. Here, we report the first full plastid sequences of this family, compare general features of the sampled plastomes to other sequenced Myrtales, and survey the plastomes for highly informative regions for phylogenetics.

    Methods

    Genome skimming was performed for 16 species spread across the Melastomataceae. Plastomes were assembled, annotated and compared to eight sequenced plastids in the Myrtales. Phylogenetic inference was performed using Maximum Likelihood on six different data sets, where putative biases were taken into account. Summary statistics were generated for all introns and intergenic spacers with suitable size for polymerase chain reaction (PCR) amplification and used to rank the markers by phylogenetic information.

    Results

    The majority of the plastomes sampled are conserved in gene content and order, as well as in sequence length and GC content within plastid regions and sequence classes. Departures include the putative presence ofrps16andrpl2pseudogenes in some plastomes. Phylogenetic analyses of the majority of the schemes analyzed resulted in the same topology with high values of bootstrap support. Although there is still uncertainty in some relationships, in the highest supported topologies onlymore »two nodes received bootstrap values lower than 95%.

    Discussion

    Melastomataceae plastomes are no exception for the general patterns observed in the genomic structure of land plant chloroplasts, being highly conserved and structurally similar to most other Myrtales. Despite the fact that the full plastome phylogeny shares most of the clades with the previously widely used and reduced data set, some changes are still observed and bootstrap support is higher. The plastome data set presented here is a step towards phylogenomic analyses in the Melastomataceae and will be a useful resource for future studies.

    « less
  2. Comprising 501 genera and around 14,000 species, Papilionoideae is not only the largest subfamily of Fabaceae (Leguminosae; legumes), but also one of the most extraordinarily diverse clades among angiosperms. Papilionoids are a major source of food and forage, are ecologically successful in all major biomes, and display dramatic variation in both floral architecture and plastid genome (plastome) structure. Plastid DNA-based phylogenetic analyses have greatly improved our understanding of relationships among the major groups of Papilionoideae, yet the backbone of the subfamily phylogeny remains unresolved. In this study, we sequenced and assembled 39 new plastomes that are covering key genera representing the morphological diversity in the subfamily. From 244 total taxa, we produced eight datasets for maximum likelihood (ML) analyses based on entire plastomes and/or concatenated sequences of 77 protein-coding sequences (CDS) and two datasets for multispecies coalescent (MSC) analyses based on individual gene trees. We additionally produced a combined nucleotide dataset comprising CDS plus matK gene sequences only, in which most papilionoid genera were sampled. A ML tree based on the entire plastome maximally supported all of the deep and most recent divergences of papilionoids (223 out of 236 nodes). The Swartzieae, ADA (Angylocalyceae, Dipterygeae, and Amburaneae), Cladrastis, Andira, andmore »Exostyleae clades formed a grade to the remainder of the Papilionoideae, concordant with nine ML and two MSC trees. Phylogenetic relationships among the remaining five papilionoid lineages (Vataireoid, Dermatophyllum , Genistoid s.l., Dalbergioid s.l., and Baphieae + Non-Protein Amino Acid Accumulating or NPAAA clade) remained uncertain, because of insufficient support and/or conflicting relationships among trees. Our study fully resolved most of the deep nodes of Papilionoideae, however, some relationships require further exploration. More genome-scale data and rigorous analyses are needed to disentangle phylogenetic relationships among the five remaining lineages.« less
  3. Species within the green algal order Cladophorales have an unconventional plastome structure where individual coding regions or small numbers of genes occur as linear single-stranded DNAs folded into hairpin structures. Another group of photosynthetic organisms with an equivalently reduced chloroplast genome are the peridinin dinoflagellates of the Alveolata eukaryotic lineage whose plastomes are mini-circles carrying one or a few genes required for photosynthesis. One unusual aspect of the Alveolata is the polyuridylylation of mRNA 30 ends among peridinin dinoflagellates and the chromerid algae. This study was conducted to understand if an unconventional highly reduced plastome structure co-occurs with unconventional RNA processing. To address this, the 50 and 30 mRNA termini of the known chloroplast genes of Pithophora roettleri (order Cladophorales) were analyzed for evidence of post-transcriptional processing. Circular Reverse Transcriptase PCR (cRT-PCR) followed by deep sequencing of the amplicons was used to analyze 50 and 30 mRNA termini. Evidence of several processing events were collected, most notably the 30 termini of six of the eight genes were polyuridylylated, which has not been reported for any lineage outside of the Alveolata. Other processing events include poly(A) and heteropolymeric 30 additions, 50 primary transcript start sites, as well as the presence ofmore »circularized RNAs. Five other species representing other green algal lineages were also tested and poly(U) additions appear to be limited to the order Cladophorales. These results demonstrate that chloroplast mRNA polyuridylylation is not the sole provenance of photosynthetic alveolates and may have convergently evolved in two distinct photosynthetic lineages.« less
  4. Species within the green algal order Cladophorales have an unconventional plastome structure where individual coding regions or small numbers of genes occur as linear single-stranded DNAs folded into hairpin structures. Another group of photosynthetic organisms with an equivalently reduced chloroplast genome are the peridinin dinoflagellates of the Alveolata eukaryotic lineage whose plastomes are mini-circles carrying one or a few genes required for photosynthesis. One unusual aspect of the Alveolata is the polyuridylylation of mRNA 30 ends among peridinin dinoflagellates and the chromerid algae. This study was conducted to understand if an unconventional highly reduced plastome structure co-occurs with unconventional RNA processing. To address this, the 50 and 30 mRNA termini of the known chloroplast genes of Pithophora roettleri (order Cladophorales) were analyzed for evidence of post-transcriptional processing. Circular Reverse Transcriptase PCR (cRT-PCR) followed by deep sequencing of the amplicons was used to analyze 50 and 30 mRNA termini. Evidence of several processing events were collected, most notably the 30 termini of six of the eight genes were polyuridylylated, which has not been reported for any lineage outside of the Alveolata. Other processing events include poly(A) and heteropolymeric 30 additions, 50 primary transcript start sites, as well as the presence ofmore »circularized RNAs. Five other species representing other green algal lineages were also tested and poly(U) additions appear to be limited to the order Cladophorales. These results demonstrate that chloroplast mRNA polyuridylylation is not the sole provenance of photosynthetic alveolates and may have convergently evolved in two distinct photosynthetic lineages.« less
  5. Abstract The complete chloroplast and mitochondrial genomes of Charophyta have shed new light on land plant terrestrialization. Here, we report the organellar genomes of the Zygnema circumcarinatum strain UTEX 1559, and a comparative genomics investigation of 33 plastomes and 18 mitogenomes of Chlorophyta, Charophyta (including UTEX 1559 and its conspecific relative SAG 698-1a), and Embryophyta. Gene presence/absence was determined across these plastomes and mitogenomes. A comparison between the plastomes of UTEX 1559 (157 548 bp) and SAG 698-1a (165 372 bp) revealed very similar gene contents, but substantial genome rearrangements. Surprisingly, the two plastomes share only 85.69% nucleotide sequence identity. The UTEX 1559 mitogenome size is 215 954 bp, the largest among all sequenced Charophyta. Interestingly, this large mitogenome contains a 50 kb region without homology to any other organellar genomes, which is flanked by two 86 bp direct repeats and contains 15 ORFs. These ORFs have significant homology to proteins from bacteria and plants with functions such as primase, RNA polymerase, and DNA polymerase. We conclude that (i) the previously published SAG 698-1a plastome is probably from a different Zygnema species, and (ii) the 50 kb region in the UTEX 1559 mitogenome might be recently acquired as a mobilemore »element.« less