Vertebrates have distinct tissues which are not present in invertebrate chordates nor other metazoans. The rise of these tissues also coincided with at least one round of whole-genome duplication as well as a suite of lineage-specific segmental duplications. Understanding whether novel genes lead to the origin and diversification of novel cell types, therefore, is of great importance in vertebrate evolution. Here we were particularly interested in the evolution of the vertebrate musculoskeletal system, the muscles and connective tissues that support a diversity of body plans. A major component of the musculoskeletal extracellular matrix (ECM) is fibrillar collagens, a gene family which has been greatly expanded upon in vertebrates. We thus asked whether the repertoire of fibrillar collagens in vertebrates reflects differences in the musculoskeletal system. To test this, we explored the diversity of fibrillar collagens in lamprey, a jawless vertebrate which diverged from jawed vertebrates (gnathostomes) more than five hundred million years ago and has undergone its own gene duplications. Some of the principal components of vertebrate hyaline cartilage are the fibrillar collagens type II and XI, but their presence in cartilage development across all vertebrate taxa has been disputed. We particularly emphasized the characterization of genes in the lamprey hyaline cartilage, testing if its collagen repertoire was similar to that in gnathostomes. Overall, we discovered thirteen fibrillar collagens from all known gene subfamilies in lamprey and were able to identify several lineage-specific duplications. We found that, while the collagen loci have undergone rearrangement, the Clade A genes have remained linked with the hox clusters, a phenomenon also seen in gnathostomes. While the lamprey muscular tissue was largely similar to that seen in gnathostomes, we saw considerable differences in the larval lamprey skeletal tissue, with distinct collagen combinations pertaining to different cartilage types. Our gene expression analyses were unable to identify type II collagen in the sea lamprey hyaline cartilage nor any other fibrillar collagen during chondrogenesis at the stages observed, meaning that sea lamprey likely no longer require these genes during early cartilage development. Our findings suggest that fibrillar collagens were multifunctional across the musculoskeletal system in the last common ancestor of vertebrates and have been largely conserved, but these genes alone cannot explain the origin of novel cell types.
more »
« less
Evolution of the nitric oxide synthase family in vertebrates and novel insights in gill development
Nitric oxide (NO) is an ancestral key signalling molecule essential for life and has enormous versatility in biological systems, including cardiovascular homeostasis, neurotransmission and immunity. Although our knowledge of NO synthases (Nos), the enzymes that synthesize NO in vivo , is substantial, the origin of a large and diversified repertoire of nos gene orthologues in fishes with respect to tetrapods remains a puzzle. The recent identification of nos3 in the ray-finned fish spotted gar, which was considered lost in this lineage, changed this perspective. This finding prompted us to explore nos gene evolution, surveying vertebrate species representing key evolutionary nodes. This study provides noteworthy findings: first, nos2 experienced several lineage-specific gene duplications and losses. Second, nos3 was found to be lost independently in two different teleost lineages, Elopomorpha and Clupeocephala. Third, the expression of at least one nos paralogue in the gills of developing shark, bichir, sturgeon, and gar, but not in lamprey, suggests that nos expression in this organ may have arisen in the last common ancestor of gnathostomes. These results provide a framework for continuing research on nos genes’ roles, highlighting subfunctionalization and reciprocal loss of function that occurred in different lineages during vertebrate genome duplications.
more »
« less
- Award ID(s):
- 2029216
- NSF-PAR ID:
- 10377221
- Date Published:
- Journal Name:
- Proceedings of the Royal Society B: Biological Sciences
- Volume:
- 289
- Issue:
- 1980
- ISSN:
- 0962-8452
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
ABSTRACT Most freshwater bacterial communities are characterized by a few dominant taxa that are often ubiquitous across freshwater biomes worldwide. Our understanding of the genomic diversity within these taxonomic groups is limited to a subset of taxa. Here, we investigated the genomic diversity that enables Limnohabitans , a freshwater genus key in funneling carbon from primary producers to higher trophic levels, to achieve abundance and ubiquity. We reconstructed eight putative Limnohabitans metagenome-assembled genomes (MAGs) from stations located along broad environmental gradients existing in Lake Michigan, part of Earth’s largest surface freshwater system. De novo strain inference analysis resolved a total of 23 strains from these MAGs, which strongly partitioned into two habitat-specific clusters with cooccurring strains from different lineages. The largest number of strains belonged to the abundant LimB lineage, for which robust in situ strain delineation had not previously been achieved. Our data show that temperature and nutrient levels may be important environmental parameters associated with microdiversification within the Limnohabitans genus. In addition, strains predominant in low- and high-phosphorus conditions had larger genomic divergence than strains abundant under different temperatures. Comparative genomics and gene expression analysis yielded evidence for the ability of LimB populations to exhibit cellular motility and chemotaxis, a phenotype not yet associated with available Limnohabitans isolates. Our findings broaden historical marker gene-based surveys of Limnohabitans microdiversification and provide in situ evidence of genome diversity and its functional implications across freshwater gradients. IMPORTANCE Limnohabitans is an important bacterial taxonomic group for cycling carbon in freshwater ecosystems worldwide. Here, we examined the genomic diversity of different Limnohabitans lineages. We focused on the LimB lineage of this genus, which is globally distributed and often abundant, and its abundance has shown to be largely invariant to environmental change. Our data show that the LimB lineage is actually comprised of multiple cooccurring populations for which the composition and genomic characteristics are associated with variations in temperature and nutrient levels. The gene expression profiles of this lineage suggest the importance of chemotaxis and motility, traits that had not yet been associated with the Limnohabitans genus, in adapting to environmental conditions.more » « less
-
Rogers, Rebekah (Ed.)Abstract Whole-genome duplications (WGDs) have shaped the gene repertoire of many eukaryotic lineages. The redundancy created by WGDs typically results in a phase of massive gene loss. However, some WGD–derived paralogs are maintained over long evolutionary periods, and the relative contributions of different selective pressures to their maintenance are still debated. Previous studies have revealed a history of three successive WGDs in the lineage of the ciliate Paramecium tetraurelia and two of its sister species from the Paramecium aurelia complex. Here, we report the genome sequence and analysis of 10 additional P. aurelia species and 1 additional out group, revealing aspects of post-WGD evolution in 13 species sharing a common ancestral WGD. Contrary to the morphological radiation of vertebrates that putatively followed two WGD events, members of the cryptic P. aurelia complex have remained morphologically indistinguishable after hundreds of millions of years. Biases in gene retention compatible with dosage constraints appear to play a major role opposing post-WGD gene loss across all 13 species. In addition, post-WGD gene loss has been slower in Paramecium than in other species having experienced genome duplication, suggesting that the selective pressures against post-WGD gene loss are especially strong in Paramecium. A near complete lack of recent single-gene duplications in Paramecium provides additional evidence for strong selective pressures against gene dosage changes. This exceptional data set of 13 species sharing an ancestral WGD and 2 closely related out group species will be a useful resource for future studies on Paramecium as a major model organism in the evolutionary cell biology.more » « less
-
Goldman, Gustavo H. (Ed.)ABSTRACT Gene expression divergence through evolutionary processes is thought to be important for achieving programmed development in multicellular organisms. To test this premise in filamentous fungi, we investigated transcriptional profiles of 3,942 single-copy orthologous genes (SCOGs) in five related sordariomycete species that have morphologically diverged in the formation of their flask-shaped perithecia. We compared expression of the SCOGs to inferred gene expression levels of the most recent common ancestor of the five species, ranking genes from their largest increases to smallest increases in expression during perithecial development in each of the five species. We found that a large proportion of the genes that exhibited evolved increases in gene expression were important for normal perithecial development in Fusarium graminearum . Many of these genes were previously uncharacterized, encoding hypothetical proteins without any known functional protein domains. Interestingly, the developmental stages during which aberrant knockout phenotypes appeared largely coincided with the elevated expression of the deleted genes. In addition, we identified novel genes that affected normal perithecial development in Magnaporthe oryzae and Neurospora crassa , which were functionally and transcriptionally diverged from the orthologous counterparts in F. graminearum . Furthermore, comparative analysis of developmental transcriptomes and phylostratigraphic analysis suggested that genes encoding hypothetical proteins are generally young and transcriptionally divergent between related species. This study provides tangible evidence of shifts in gene expression that led to acquisition of novel function of orthologous genes in each lineage and demonstrates that several genes with hypothetical function are crucial for shaping multicellular fruiting bodies. IMPORTANCE The fungal class Sordariomycetes includes numerous important plant and animal pathogens. It also provides model systems for studying fungal fruiting body development, as its members develop fruiting bodies with a few well-characterized tissue types on common growth media and have rich genomic resources that enable comparative and functional analyses. To understand transcriptional divergence of key developmental genes between five related sordariomycete fungi, we performed targeted knockouts of genes inferred to have evolved significant upward shifts in expression. We found that many previously uncharacterized genes play indispensable roles at different stages of fruiting body development, which have undergone transcriptional activation in specific lineages. These novel genes are predicted to be phylogenetically young and tend to be involved in lineage- or species-specific function. Transcriptional activation of genes with unknown function seems to be more frequent than ever thought, which may be crucial for rapid adaption to changing environments for successful sexual reproduction.more » « less
-
null (Ed.)Abstract High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species 1–4 . To address this issue, the international Genome 10K (G10K) consortium 5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.more » « less