skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Identifying Candidate Genetic Markers of CDV Cross-Species Pathogenicity in African Lions
Canine distemper virus (CDV) is a multi-host pathogen with variable clinical outcomes of infection across and within species. We used whole-genome sequencing (WGS) to search for viral markers correlated with clinical distemper in African lions. To identify candidate markers, we first documented single-nucleotide polymorphisms (SNPs) differentiating CDV strains associated with different clinical outcomes in lions in East Africa. We then conducted evolutionary analyses on WGS from all global CDV lineages to identify loci subject to selection. SNPs that both differentiated East African strains and were under selection were mapped to a phylogenetic tree representing global CDV diversity to assess if candidate markers correlated with documented outbreaks of clinical distemper in lions (n = 3). Of 54 SNPs differentiating East African strains, ten were under positive or episodic diversifying selection and 20 occurred in the clinical strain despite strong purifying selection at those loci. Candidate markers were in functional domains of the RNP complex (n = 19), the matrix protein (n = 4), on CDV glycoproteins (n = 5), and on the V protein (n = 1). We found mutations at two loci in common between sequences from three CDV outbreaks of clinical distemper in African lions; one in the signaling lymphocytic activation molecule receptor (SLAM)-binding region of the hemagglutinin protein and another in the catalytic center of phosphodiester bond formation on the large polymerase protein. These results suggest convergent evolution at these sites may have a functional role in clinical distemper outbreaks in African lions and uncover potential novel barriers to pathogenicity in this species.  more » « less
Award ID(s):
1907022
PAR ID:
10341988
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Pathogens
Volume:
9
Issue:
11
ISSN:
2076-0817
Page Range / eLocation ID:
872
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract A large number of genetic variations have been identified to be associated with Alzheimer’s disease (AD) and related quantitative traits. However, majority of existing studies focused on single types of omics data, lacking the power of generating a community including multi-omic markers and their functional connections. Because of this, the immense value of multi-omics data on AD has attracted much attention. Leveraging genomic, transcriptomic and proteomic data, and their backbone network through functional relations, we proposed a modularity-constrained logistic regression model to mine the association between disease status and a group of functionally connected multi-omic features, i.e. single-nucleotide polymorphisms (SNPs), genes and proteins. This new model was applied to the real data collected from the frontal cortex tissue in the Religious Orders Study and Memory and Aging Project cohort. Compared with other state-of-art methods, it provided overall the best prediction performance during cross-validation. This new method helped identify a group of densely connected SNPs, genes and proteins predictive of AD status. These SNPs are mostly expression quantitative trait loci in the frontal region. Brain-wide gene expression profile of these genes and proteins were highly correlated with the brain activation map of ‘vision’, a brain function partly controlled by frontal cortex. These genes and proteins were also found to be associated with the amyloid deposition, cortical volume and average thickness of frontal regions. Taken together, these results suggested a potential pathway underlying the development of AD from SNPs to gene expression, protein expression and ultimately brain functional and structural changes. 
    more » « less
  2. Pathogen emergence is a complex phenomenon that, despite its public health relevance, remains poorly understood. Vibrio vulnificus, an emergent human pathogen, can cause a deadly septicaemia with over 50% mortality rate. To date, the ecological drivers that lead to the emergence of clinical strains and the unique genetic traits that allow these clones to colonize the human host remain mostly unknown. We recently surveyed a large estuary in eastern Florida, where outbreaks of the disease frequently occur, and found endemic populations of the bacterium. We established two sampling sites and observed strong correlations between location and pathogenic potential. One site is significantly enriched with strains that belong to one phylogenomic cluster (C1) from which the majority of clinical strains belong to. Interestingly, strains isolated from this site exhibit phenotypic traits associated with clinical outcomes, whereas strains from the second site belong to a cluster that rarely causes disease in humans (C2). Analyses of C1 genomes indicate unique genetic markers in the form of clinical-associated alleles with potential role in virulence. Finally, metagenomic and physicochemical analyses of the sampling sites indicate that this marked cluster distribution and genetic traits are strongly associated with distinct biotic and abiotic factors (e.g. salinity, nutrients, or biodiversity), revealing how ecosystems generate selective pressures that facilitate the emergence of specific strains with pathogenic potential in a population. This knowledge can be applied to assess the risk of pathogen emergence from environmental sources, and integrated towards the development of novel strategies for the prevention of future outbreaks. 
    more » « less
  3. Björkroth, Johanna (Ed.)
    ABSTRACT Giardia duodenalis (syn. Giardia lamblia , Giardia intestinalis ) is the causative agent of giardiasis, one of the most common diarrheal infections in humans. Evolutionary relationships among G. duodenalis genotypes (or subtypes) of assemblage B, one of two genetic assemblages causing the majority of human infections, remain unclear due to poor phylogenetic resolution of current typing methods. In this study, we devised a methodology to identify new markers for a streamlined multilocus sequence typing (MLST) scheme based on comparisons of all core genes against the phylogeny of whole-genome sequences (WGS). Our analysis identified three markers with resolution comparable to that of WGS data. Using newly designed PCR primers for our novel MLST loci, we typed an additional 68 strains of assemblage B. Analyses of these strains and previously determined genome sequences showed that genomes of this assemblage can be assigned to 16 clonal complexes, each with unique gene content that is apparently tuned to differential virulence and ecology. Obtaining new genomes of Giardia spp. and other eukaryotic microbial pathogens remains challenging due to difficulties in culturing the parasites in the laboratory. Hence, the methods described here are expected to be widely applicable to other pathogens of interest and advance our understanding of their ecology and evolution. IMPORTANCE Giardia duodenalis assemblage B is a major waterborne pathogen and the most commonly identified genotype causing human giardiasis worldwide. The lack of morphological characters for classification requires the use of molecular techniques for strain differentiation; however, the absence of scalable and affordable next-generation sequencing (NGS)-based typing methods has prevented meaningful advancements in high-resolution molecular typing for further understanding of the evolution and epidemiology of assemblage B. Prior studies have reported high sequence diversity but low phylogenetic resolution at standard loci in assemblage B, highlighting the necessity of identifying new markers for accurate and robust molecular typing. Data from comparative analyses of available genomes in this study identified three loci that together form a novel high-resolution typing scheme with high concordance to whole-genome-based phylogenomics and which should aid in future public health endeavors related to this parasite. In addition, data from newly characterized strains suggest evidence of biogeographic and ecologic endemism. 
    more » « less
  4. Yoshizawa, Kazunori (Ed.)
    Abstract The order Psocodea includes the two historically recognized groups Psocoptera (free-living bark lice) and Phthiraptera (parasitic lice) that were once considered separate orders. Psocodea is divided in three suborders: Trogiomorpha, Troctomorpha, and Psocomorpha, the latter being the largest within the free-living groups. Despite the increasing number of transcriptomes and whole genome sequence (WGS) data available for this group, the relationships among the six known infraorders within Psocomorpha remain unclear. Here, we evaluated the utility of a bait set designed specifically for parasitic lice belonging to suborder Troctomorpha to extract UCE loci from transcriptome and WGS data of 55 bark louse species and explored the phylogenetic relationships within Psocomorpha using these UCE loci markers. Taxon sampling was heavily focused on the families Lachesillidae and Elipsocidae, whose relationships have been problematic in prior phylogenetic studies. We successfully recovered a total of 2,622 UCE loci, with a 40% completeness matrix containing 2,081 UCE loci and an 80% completeness matrix containing 178 UCE loci. The average number of UCE loci recovered for the 55 species was 1,401. The WGS data sets produced a larger number of UCE loci (1,495) on average than the transcriptome data sets (972). Phylogenetic relationships reconstructed with Maximum Likelihood and coalescent-based analysis were concordant regarding the paraphyly of Lachesillidae and Elipsocidae. Branch support values were generally lower in analyses that used a fewer number of loci, even though they had higher matrix completeness. 
    more » « less
  5. Abstract Selection that acts in a sex-specific manner causes the evolution of sexual dimorphism. Sex-specific phenotypic selection has been demonstrated in many taxa and can be in the same direction in the two sexes (differing only in magnitude), limited to one sex, or in opposing directions (antagonistic). Attempts to detect the signal of sex-specific selection from genomic data have confronted numerous difficulties. These challenges highlight the utility of “direct approaches,” in which fitness is predicted from individual genotype within each sex. Here, we directly measured selection on Single Nucleotide Polymorphisms (SNPs) in a natural population of the sexually dimorphic, dioecious plant, Silene latifolia. We measured flowering phenotypes, estimated fitness over one reproductive season, as well as survival to the next year, and genotyped all adults and a subset of their offspring for SNPs across the genome. We found that while phenotypic selection was congruent (fitness covaried similarly with flowering traits in both sexes), SNPs showed clear evidence for sex-specific selection. SNP-level selection was particularly strong in males and may involve an important gametic component (e.g., pollen competition). While the most significant SNPs under selection in males differed from those under selection in females, paternity selection showed a highly polygenic tradeoff with female survival. Alleles that increased male mating success tended to reduce female survival, indicating sexual antagonism at the genomic level. Perhaps most importantly, this experiment demonstrates that selection within natural populations can be strong enough to measure sex-specific fitness effects of individual loci. Males and females typically differ phenotypically, a phenomenon known as sexual dimorphism. These differences arise when selection on males differs from selection on females, either in magnitude or direction. Estimated relationships between traits and fitness indicate that sex-specific selection is widespread, occurring in both plants and animals, and explains why so many species exhibit sexual dimorphism. Finding the specific loci experiencing sex-specific selection is a challenging prospect but one worth undertaking given the extensive evolutionary consequences. Flowering plants with separate sexes are ideal organisms for such studies, given that the fitness of females can be estimated by counting the number of seeds they produce. Determination of fitness for males has been made easier as thousands of genetic markers can now be used to assign paternity to seeds. We undertook just such a study in S. latifolia, a short-lived, herbaceous plant. We identified loci under sex-specific selection in this species and found more loci affecting fitness in males than females. Importantly, loci with major effects on male fitness were distinct from the loci with major effects on females. We detected sexual antagonism only when considering the aggregate effect of many loci. Hence, even though males and females share the same genome, this does not necessarily impose a constraint on their independent evolution. 
    more » « less