skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Ancient Host-Virus Gene Transfer Hints at a Diverse Pre-LECA Virosphere
Abstract The details surrounding the early evolution of eukaryotes and their viruses are largely unknown. Several key enzymes involved in DNA synthesis and transcription are shared between eukaryotes and large DNA viruses in the phylumNucleocytoviricota, but the evolutionary relationships between these genes remain unclear. In particular, previous studies of eukaryotic DNA and RNA polymerases often show deep-branching clades of eukaryotes and viruses indicative of ancient gene exchange. Here, we performed updated phylogenetic analysis of eukaryotic and viral family B DNA polymerases, multimeric RNA polymerases, and mRNA-capping enzymes to explore their evolutionary relationships. Our results show that viral enzymes form clades that are typically adjacent to eukaryotes, suggesting that they originate prior to the emergence of the Last Eukaryotic Common Ancestor (LECA). The machinery for viral DNA replication, transcription, and mRNA capping are all key processes needed for the maintenance of virus factories, which are complex structures formed by many nucleocytoviruses during infection, indicating that viruses capable of making these structures are ancient. These findings hint at a diverse and complex pre-LECA virosphere and indicate that large DNA viruses may encode proteins that are relics of extinct proto-eukaryotic lineages.  more » « less
Award ID(s):
2141862
PAR ID:
10659158
Author(s) / Creator(s):
; ;
Publisher / Repository:
Springer Nature
Date Published:
Journal Name:
Journal of Molecular Evolution
Volume:
93
Issue:
3
ISSN:
0022-2844
Page Range / eLocation ID:
295 to 305
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Parent, Kristin N (Ed.)
    ABSTRACT Members of the phylumNucleocytoviricota, which include “giant viruses” known for their large physical dimensions and genome lengths, are a diverse group of dsDNA viruses that infect a wide range of eukaryotic hosts. The genomes of nucleocytoviruses frequently encode eukaryotic signature proteins (ESPs) such as RNA- and DNA-processing proteins, vesicular trafficking factors, cytoskeletal components, and proteins involved in ubiquitin signaling. Despite the prevalence of these genes in many nucleocytoviruses, the timing and number of gene acquisitions remains unclear. While the presence of DNA- and RNA-processing proteins in nucleocytoviruses likely reflects ancient gene transfers, the origins and evolutionary history of other proteins are largely unknown. In this study, we examined the distribution and evolutionary history of a subset of viral-encoded ESPs (vESPs) that are widespread in nucleocytoviruses. Our results demonstrate that most vESPs involved in vesicular trafficking were acquired multiple times independently by nucleocytoviruses at different time points after the emergence of the eukaryotic supergroups, while viral proteins associated with cytoskeletal and ubiquitin system proteins exhibited a more complex evolutionary pattern exhibited by both shallow and deep branching viral clades. This pattern reveals a dynamic interplay between the co-evoluton of eukaryotes and their viruses, suggesting that the viral acquisition of many genes involved in cellular processes has occurred both through ancient and more recent horizontal gene transfers. The timing and frequency of these gene acquisitions may provide insight into their role and functional significance during viral infection.IMPORTANCEThis research is pertinent for understanding the evolution of nucleocytoviruses and their interactions with eukaryotic hosts. By investigating the distribution and evolutionary history of viral-encoded eukaryotic signature proteins, the study reveals gene transfer patterns, highlighting how viruses acquire genes that allow them to manipulate host cellular processes. Identifying the timing and frequency of gene acquisitions related to essential cellular functions provides insights into their roles during viral infections. This work expands our understanding of viral diversity and adaptability, contributing valuable knowledge to virology and evolutionary biology, while offering new perspectives on the relationship between viruses and their hosts. 
    more » « less
  2. Viruses are the most abundant and diverse biological entities on the planet and constitute a significant proportion of Earth’s genetic diversity. Most of this diversity is not represented by isolated viral-host systems and has only been observed through sequencing of viral metagenomes (viromes) from environmental samples. Viromes provide snapshots of viral genetic potential, and a wealth of information on viral community ecology. These data also provide opportunities for exploring the biochemistry of novel viral enzymes. The in vitro biochemical characteristics of novel viral DNA polymerases were explored, testing hypothesized differences in polymerase biochemistry according to protein sequence phylogeny. Forty-eight viral DNA Polymerase I (PolA) proteins from estuarine viromes, hot spring metagenomes, and reference viruses, encompassing a broad representation of currently known diversity, were synthesized, expressed, and purified. Novel functionality was shown in multiple PolAs. Intriguingly, some of the estuarine viral polymerases demonstrated moderate to strong innate DNA strand displacement activity at high enzyme concentration. Strand-displacing polymerases have important technological applications where isothermal reactions are desirable. Bioinformatic investigation of genes neighboring these strand displacing polymerases found associations with SNF2 helicase-associated proteins. The specific function of SNF2 family enzymes is unknown for prokaryotes and viruses. In eukaryotes, SNF2 enzymes have chromatin remodeling functions but do not separate nucleic acid strands. This suggests the strand separation function may be fulfilled by the DNA polymerase for viruses carrying SNF2 helicase-associated proteins. Biochemical data elucidated from this study expands understanding of the biology and ecological behavior of unknown viruses. Moreover, given the numerous biotechnological applications of viral DNA polymerases, novel viral polymerases discovered within viromes may be a rich source of biological material for further in vitro DNA amplification advancements. 
    more » « less
  3. Abstract N6-adenine methylation occurs in both DNA and RNA (referred to as 6mA and m6A, respectively). As an extensively characterized epi-transcriptomic mark found in virtually all eukaryotes, m6A in mRNA is deposited by METTL3-METTL14 complex. As a transcription-associated epigenetic mark abundantly present in many unicellular eukaryotes, 6mA is coordinately maintained by two AMT1 complexes, distinguished by their mutually exclusive subunits, AMT6 and AMT7. These are all members of MT-A70 family methyltransferases (MTases). Despite their functional importance, no structure for holo-complexes with cognate DNA/RNA substrate has been resolved. Here, we employ AlphaFold3 (AF3) and molecular dynamics (MD) simulations for structural modeling ofTetrahymenaAMT1 complexes, with emphasis on ternary holo-complexes with double-stranded DNA (dsDNA) substrate and cofactor. Key structural features observed in these models are validated by mutagenesis and various other biophysical and biochemical approaches. Our analysis reveals the structural basis for DNA substrate recognition, base flipping, and catalysis in the prototypical eukaryotic DNA 6mA-MTase. It also allows us to delineate the reaction pathway for processive DNA methylation involving translocation of the closed form AMT1 complex along dsDNA. As the active site is highly conserved across MT-A70 family of eukaryotic 6mA/m6A-MTases, the structural insight will facilitate rational design of small molecule inhibitors, especially for METTL3-METTL14, a promising target in cancer therapeutics. 
    more » « less
  4. AbstractReverse transcriptases (RT), or RNA-dependent DNA polymerases, are unorthodox enzymes that originally added a new angle to the conventional view of the unidirectional flow of genetic information in the cell from DNA to RNA to protein. First discovered in vertebrate retroviruses, RTs were since re-discovered in most eukaryotes, bacteria, and archaea, spanning essentially all domains of life. For retroviruses, RTs provide the ability to copy the RNA genome into DNA for subsequent incorporation into the host genome, which is essential for their replication and survival. In cellular organisms, most RT sequences originate from retrotransposons, the type of self-replicating genetic elements that rely on reverse transcription to copy and paste their sequences into new genomic locations. Some retroelements, however, can undergo domestication, eventually becoming a valuable addition to the overall repertoire of cellular enzymes. They can be beneficial yet accessory, like the diversity-generating elements, or even essential, like the telomerase reverse transcriptases. Nowadays, ever-increasing numbers of domesticated RT-carrying genetic elements are being discovered. It may be argued that domesticated RTs and reverse transcription in general is more widespread in cellular organisms than previously thought, and that many important cellular functions, such as chromosome end maintenance, may evolve from an originally selfish process of converting RNA into DNA. 
    more » « less
  5. Champion, Patricia A (Ed.)
    ABSTRACT Tuberculosis is caused by the bacteriumMycobacterium tuberculosis(Mtb). While eukaryotic species employ several specialized RNA polymerases (Pols) to fulfill the RNA synthesis requirements of the cell, bacterial species use a single RNA polymerase (RNAP). To contribute to the foundational understanding of how Mtb and the related non-pathogenic mycobacterial species,Mycobacterium smegmatis(Msm), perform the essential function of RNA synthesis, we performed a series ofin vitrotranscription experiments to define the unique enzymatic properties of Mtb and Msm RNAPs. In this study, we characterize the mechanism of nucleotide addition used by these bacterial RNAPs with comparisons to previously characterized eukaryotic Pols I, II, and III. We show that Mtb RNAP and Msm RNAP demonstrate similar enzymatic properties and nucleotide addition kinetics to each other but diverge significantly from eukaryotic Pols. We also show that Mtb RNAP and Msm RNAP uniquely bind a nucleotide analog with significantly higher affinity than canonical nucleotides, in contrast to eukaryotic RNA polymerase II. This affinity for analogs may reveal a vulnerability for selective inhibition of the pathogenic bacterial enzyme.IMPORTANCETuberculosis, caused by the bacteriumMycobacterium tuberculosis(Mtb), remains a severe global health threat. The World Health Organization (WHO) has reported that tuberculosis is second only to COVID-19 as the most lethal infection worldwide, with more annual deaths than HIV and AIDS (WHO.int). The first-line treatment for tuberculosis, Rifampin (or Rifampicin), specifically targets the Mtb RNA polymerase. This drug has been used for decades, leading to increased numbers of multi-drug-resistant infections (Stephanie,et al). To effectively treat tuberculosis, there is an urgent need for new therapeutics that selectively target vulnerabilities of the bacteria and not the host. Characterization of the differences between Mtb enzymes and host enzymes is critical to inform these ongoing drug design efforts. 
    more » « less