skip to main content


Title: Protein Structure, Models of Sequence Evolution, and Data Type Effects in Phylogenetic Analyses of Mitochondrial Data: A Case Study in Birds
Phylogenomic analyses have revolutionized the study of biodiversity, but they have revealed that estimated tree topologies can depend, at least in part, on the subset of the genome that is analyzed. For example, estimates of trees for avian orders differ if protein-coding or non-coding data are analyzed. The bird tree is a good study system because the historical signal for relationships among orders is very weak, which should permit subtle non-historical signals to be identified, while monophyly of orders is strongly corroborated, allowing identification of strong non-historical signals. Hydrophobic amino acids in mitochondrially-encoded proteins, which are expected to be found in transmembrane helices, have been hypothesized to be associated with non-historical signals. We tested this hypothesis by comparing the evolution of transmembrane helices and extramembrane segments of mitochondrial proteins from 420 bird species, sampled from most avian orders. We estimated amino acid exchangeabilities for both structural environments and assessed the performance of phylogenetic analysis using each data type. We compared those relative exchangeabilities with values calculated using a substitution matrix for transmembrane helices estimated using a variety of nuclear- and mitochondrially-encoded proteins, allowing us to compare the bird-specific mitochondrial models with a general model of transmembrane protein evolution. To complement our amino acid analyses, we examined the impact of protein structure on patterns of nucleotide evolution. Models of transmembrane and extramembrane sequence evolution for amino acids and nucleotides exhibited striking differences, but there was no evidence for strong topological data type effects. However, incorporating protein structure into analyses of mitochondrially-encoded proteins improved model fit. Thus, we believe that considering protein structure will improve analyses of mitogenomic data, both in birds and in other taxa.  more » « less
Award ID(s):
1655683
NSF-PAR ID:
10359424
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Diversity
Volume:
13
Issue:
11
ISSN:
1424-2818
Page Range / eLocation ID:
555
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Summary

    Malonyl‐CoA is a key intermediate in a number of metabolic processes associated with its role as a substrate in acylation and condensation reactions. These types of reactions occur in plastids, the cytosol and mitochondria, and although carboxylation of acetyl‐CoA is the known mechanism for generating the distinct plastidial and cytosolic pools, the metabolic origin of the mitochondrial malonyl‐CoA pool is still unclear. In this study we demonstrate that malonyl‐CoA synthetase encoded by the ArabidopsisAAE13(AT3G16170) gene is localized in both the cytosol and the mitochondria. These isoforms are translated from two types of transcripts, one that contains and one that does not contain a mitochondrial‐targeting pre‐sequence. Whereas the cytosolicAAE13 protein is not essential, due to the presence of a redundant malonyl‐CoA generating system provided by a cytosolic acetyl‐CoA carboxylase, the mitochondrialAAE13 protein is essential for plant growth. Phenotypes of theaae13‐1mutant are transgenically reversed only if the mitochondrial pre‐sequence is present in the ectopically expressedAAE13 proteins. Theaae13‐1mutant exhibits typical metabolic phenotypes associated with a deficiency in the mitochondrial fatty acid synthase system, namely depleted lipoylation of the H subunit of the photorespiratory enzyme glycine decarboxylase, increased accumulation of glycine and glycolate and reduced levels of sucrose. Most of these metabolic alterations, and associated morphological changes, are reversed when theaae13‐1mutant is grown in a non‐photorespiratory condition (i.e. a 1%CO2atmosphere), demonstrating that they are a consequence of the deficiency in photorespiration due to the inability to generate lipoic acid from mitochondrially synthesized fatty acids.

     
    more » « less
  2. Introduction

    Eukaryotic life depends on the functional elements encoded by both the nuclear genome and organellar genomes, such as those contained within the mitochondria. The content, size, and structure of the mitochondrial genome varies across organisms with potentially large implications for phenotypic variance and resulting evolutionary trajectories. Among yeasts in the subphylum Saccharomycotina, extensive differences have been observed in various species relative to the model yeastSaccharomyces cerevisiae, but mitochondrial genome sampling across many groups has been scarce, even as hundreds of nuclear genomes have become available.

    Methods

    By extracting mitochondrial assemblies from existing short-read genome sequence datasets, we have greatly expanded both the number of available genomes and the coverage across sparsely sampled clades.

    Results

    Comparison of 353 yeast mitochondrial genomes revealed that, while size and GC content were fairly consistent across species, those in the generaMetschnikowiaandSaccharomycestrended larger, while several species in the order Saccharomycetales, which includesS. cerevisiae, exhibited lower GC content. Extreme examples for both size and GC content were scattered throughout the subphylum. All mitochondrial genomes shared a core set of protein-coding genes for Complexes III, IV, and V, but they varied in the presence or absence of mitochondrially-encoded canonical Complex I genes. We traced the loss of Complex I genes to a major event in the ancestor of the orders Saccharomycetales and Saccharomycodales, but we also observed several independent losses in the orders Phaffomycetales, Pichiales, and Dipodascales. In contrast to prior hypotheses based on smaller-scale datasets, comparison of evolutionary rates in protein-coding genes showed no bias towards elevated rates among aerobically fermenting (Crabtree/Warburg-positive) yeasts. Mitochondrial introns were widely distributed, but they were highly enriched in some groups. The majority of mitochondrial introns were poorly conserved within groups, but several were shared within groups, between groups, and even across taxonomic orders, which is consistent with horizontal gene transfer, likely involving homing endonucleases acting as selfish elements.

    Discussion

    As the number of available fungal nuclear genomes continues to expand, the methods described here to retrieve mitochondrial genome sequences from these datasets will prove invaluable to ensuring that studies of fungal mitochondrial genomes keep pace with their nuclear counterparts.

     
    more » « less
  3. Abstract Background In genus Rhinolophus , species in the Rhinolophus philippinensis and R. macrotis groups are unique because the horseshoe bats in these group have relatively low echolocation frequencies and flight speeds compared with other horseshoe bats with similar body size. The different characteristics among bat species suggest particular evolutionary processes may have occurred in this genus. To study the adaptive evidence in the mitochondrial genomes (mitogenomes) of rhinolophids, especially the mitogenomes of the species with low echolocation frequencies, we sequenced eight mitogenomes and used them for comparative studies of molecular phylogeny and adaptive evolution. Results Phylogenetic analysis using whole mitogenome sequences produced robust results and provided phylogenetic signals that were better than those obtained using single genes. The results supported the recent establishment of the separate macrotis group. The signals of adaptive evolution discovered in the Rhinolophus species were tested for some of the codons in two genes ( ND2 and ND6 ) that encode NADH dehydrogenases in oxidative phosphorylation system complex I. These genes have a background of widespread purifying selection. Signals of relaxed purifying selection and positive selection were found in ND2 and ND6 , respectively, based on codon models and physicochemical profiles of amino acid replacements. However, no pronounced overlap was found for non-synonymous sites in the mitogenomes of all the species with low echolocation frequencies. A signal of positive selection for ND5 was found in the branch-site model when R. philippinensis was set as the foreground branch. Conclusions The mitogenomes provided robust phylogenetic signals that were much more informative than the signals obtained using single mitochondrial genes. Two mitochondrial genes that encoding proteins in the oxidative phosphorylation system showed some evidence of adaptive evolution in genus Rhinolophus and the positive selection signals were tested for ND5 in R. philippinensis . These results indicate that mitochondrial protein-coding genes were targets of adaptive evolution during the evolution of Rhinolophus species, which might have contributed to a diverse range of acoustic adaptations in this genus. 
    more » « less
  4. Abstract

    Heteroplasmy is the presence of more than one type of mitochondrial genome within an individual, a condition commonly reported as unfavorable and affecting mitonuclear interactions. So far, no study has investigated heteroplasmy at protein level, and whether it occurs within tissues, cells, or even organelles. The only known evolutionarily stable and natural heteroplasmic system in Metazoa is the Doubly Uniparental Inheritance (DUI)—reported so far in ∼100 bivalve species—in which two mitochondrial lineages are present: one transmitted through eggs (F-type) and the other through sperm (M-type). Because of such segregation, mitochondrial oxidative phosphorylation proteins reach a high amino acid sequence divergence (up to 52%) between the two lineages in the same species. Natural heteroplasmy coupled with high sequence divergence between F- and M-type proteins provides a unique opportunity to study their expression and assess the level and extent of heteroplasmy. Here, for the first time, we immunolocalized F- and M-type variants of three mitochondrially-encoded proteins in the DUI species Ruditapes philippinarum, in germline and somatic tissues at different developmental stages. We found heteroplasmy at organelle level in undifferentiated germ cells of both sexes, and in male soma, whereas gametes were homoplasmic: eggs for the F-type and sperm for the M-type. Thus, during gametogenesis, only the sex-specific mitochondrial variant is maintained, likely due to a process of meiotic drive. We examine the implications of our results for DUI proposing a revised model, and we discuss interactions of mitochondria with germ plasm and their role in germline development. Molecular and phylogenetic evidence suggests that DUI evolved from the common Strictly Maternal Inheritance, so the two systems likely share the same underlying molecular mechanism, making DUI a useful system for studying mitochondrial biology.

     
    more » « less
  5. Abstract

    In the age of next-generation sequencing, the number of loci available for phylogenetic analyses has increased by orders of magnitude. But despite this dramatic increase in the amount of data, some phylogenomic studies have revealed rampant gene-tree discordance that can be caused by many historical processes, such as rapid diversification, gene duplication, or reticulate evolution. We used a target enrichment approach to sample 400 single-copy nuclear genes and estimate the phylogenetic relationships of 13 genera in the lichen-forming family Lobariaceae to address the effect of data type (nucleotides and amino acids) and phylogenetic reconstruction method (concatenation and species tree approaches). Furthermore, we examined datasets for evidence of historical processes, such as rapid diversification and reticulate evolution. We found incongruence associated with sequence data types (nucleotide vs. amino acid sequences) and with different methods of phylogenetic reconstruction (species tree vs. concatenation). The resulting phylogenetic trees provided evidence for rapid and reticulate evolution based on extremely short branches in the backbone of the phylogenies. The observed rapid and reticulate diversifications may explain conflicts among gene trees and the challenges to resolving evolutionary relationships. Based on divergence times, the diversification at the backbone occurred near the Cretaceous-Paleogene (K-Pg) boundary (65 Mya) which is consistent with other rapid diversifications in the tree of life. Although some phylogenetic relationships within the Lobariaceae family remain with low support, even with our powerful phylogenomic dataset of up to 376 genes, our use of target-capturing data allowed for the novel exploration of the mechanisms underlying phylogenetic and systematic incongruence.

     
    more » « less