Abstract Although similar developmental regulatory networks can produce diverse phenotypes, different networks can also produce the same phenotype. In theory, as long as development can produce an acceptable end phenotype, the details of the process could be shielded from selection, leading to the possibility of developmental system drift, where the developmental mechanisms underlying a stable phenotype continue to evolve. Many examples exist of divergent developmental genetics underlying conserved traits. However, studies that elucidate how these differences arose and how other features of development accommodated them are rarer. InCaenorhabditis elegans, six GATA-type transcription factors (GATA factors) comprise the zygotic part of the endoderm specification network. Here we show that the core of this network - five of the genes - originated within the genus during a brief but explosive radiation of this gene family and that at least three of them evolved from a single ancestral gene with at least two different spatio-temporal expression patterns. Based on analyses of their evolutionary history, gene structure, expression, and sequence, we explain how these GATA factors were integrated into this network. Our results show how gene duplication fueled the developmental system drift of the endoderm network in a phylogenetically brief period in developmentally canalized worms.
more »
« less
Radiation and diversification of GATA-domain-containing proteins in the genus Caenorhabditis
Abstract Transcription factors are defined by their DNA-binding domains (DBDs). The binding affinities and specificities of a transcription factor to its DNA binding sites can be used by an organism to fine-tune gene regulation and so are targets for evolution. Here we investigate the evolution of GATA-type transcription factors (GATA factors) in theCaenorhabditisgenus. Based upon comparisons of their DBDs, these proteins form 13 distinct groups. This protein family experienced a burst of gene duplication in several of these groups along two short branches in the species tree, giving rise to subclades with very distinct complements of GATA factors. By comparing extant gene structures, DBD sequences, genome locations, and selection pressures we reconstructed how these duplications occurred. Although the paralogs have diverged in various ways, the literature shows that at least eight of the DBD groups bind to similar G-A-T-A DNA sequences. Thus, despite gene duplications and divergence among DBD sequences, mostCaenorhabditisGATA factors appear to have maintained similar binding preferences, which could create the opportunity for developmental system drift. We hypothesize that this limited divergence in binding specificities contributes to the apparent disconnect between the extensive genomic evolution that has occurred in this genus and the absence of significant anatomical changes.
more »
« less
- Award ID(s):
- 1936674
- PAR ID:
- 10552292
- Publisher / Repository:
- bioRxiv
- Date Published:
- Format(s):
- Medium: X
- Institution:
- bioRxiv
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract BackgroundIdentifying the DNA-binding specificities of transcription factors (TF) is central to understanding gene networks that regulate growth and development. Such knowledge is lacking in oomycetes, a microbial eukaryotic lineage within the stramenopile group. Oomycetes include many important plant and animal pathogens such as the potato and tomato blight agentPhytophthora infestans, which is a tractable model for studying life-stage differentiation within the group. ResultsMining of the P. infestans genome identified 197 genes encoding proteins belonging to 22 TF families. Their chromosomal distribution was consistent with family expansions through unequal crossing-over, which were likely ancient since each family had similar sizes in most oomycetes. Most TFs exhibited dynamic changes in RNA levels through the P. infestanslife cycle. The DNA-binding preferences of 123 proteins were assayed using protein-binding oligonucleotide microarrays, which succeeded with 73 proteins from 14 families. Binding sites predicted for representatives of the families were validated by electrophoretic mobility shift or chromatin immunoprecipitation assays. Consistent with the substantial evolutionary distance of oomycetes from traditional model organisms, only a subset of the DNA-binding preferences resembled those of human or plant orthologs. Phylogenetic analyses of the TF families withinP. infestansoften discriminated clades with canonical and novel DNA targets. Paralogs with similar binding preferences frequently had distinct patterns of expression suggestive of functional divergence. TFs were predicted to either drive life stage-specific expression or serve as general activators based on the representation of their binding sites within total or developmentally-regulated promoters. This projection was confirmed for one TF using synthetic and mutated promoters fused to reporter genesin vivo. ConclusionsWe established a large dataset of binding specificities forP. infestansTFs, representing the first in the stramenopile group. This resource provides a basis for understanding transcriptional regulation by linking TFs with their targets, which should help delineate the molecular components of processes such as sporulation and host infection. Our work also yielded insight into TF evolution during the eukaryotic radiation, revealing both functional conservation as well as diversification across kingdoms.more » « less
-
Transcription factors must scan genomic DNA, recognize the cognate sequence of their control element(s), and bind tightly to them. The DNA recognition process is primarily carried out by their DNA binding domains (DBD), which interact with the cognate site with high affinity and more weakly with any other DNA sequence. DBDs are generally thought to bind to their cognate DNA without changing conformation (lock-and-key). Here, we used nuclear magnetic resonance and circular dichroism to investigate the interplay between DNA recognition and DBD conformation in the engrailed homeodomain (enHD), as a model case for the homeodomain family of eukaryotic DBDs. We found that the conformational ensemble of enHD is rather flexible and becomes gradually more disordered as ionic strength decreases following a Debye–Hückel’s dependence. Our analysis indicates that enHD’s response to ionic strength is mediated by a built-in electrostatic spring-loaded latch that operates as a conformational transducer. We also found that, at moderate ionic strengths, enHD changes conformation upon binding to cognate DNA. This change is of larger amplitude and somewhat orthogonal to the response to ionic strength. As a consequence, very high ionic strengths (e.g., 700 mM) block the electrostatic-spring-loaded latch and binding to cognate DNA becomes lock-and-key. However, the interplay between enHD conformation and cognate DNA binding is robust across a range of ionic strengths (i.e., 45 to 300 mM) that covers the physiologically-relevant conditions. Therefore, our results demonstrate the presence of a mechanism for the conformational control of cognate DNA recognition on a eukaryotic DBD. This mechanism can function as a signal transducer that locks the DBD in place upon encountering the cognate site during active DNA scanning. The electrostatic-spring-loaded latch of enHD can also enable the fine control of DNA recognition in response to transient changes in local ionic strength induced by variate physiological processes.more » « less
-
The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex ( D. simulans , D. mauritiana , and D. sechellia ), which speciated ∼250,000 yr ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster owing to structural divergence—twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, whereas the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade- and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species.more » « less
-
Abstract Many eukaryotic transcription factors (TF) form homodimer or heterodimer complexes to regulate gene expression. Dimerization of BASIC LEUCINE ZIPPER (bZIP) TFs are critical for their functions, but the molecular mechanism underlying the DNA binding and functional specificity of homo-versusheterodimers remains elusive. To address this gap, we present the double DNA Affinity Purification-sequencing (dDAP-seq) technique that maps heterodimer binding sites on endogenous genomic DNA. Using dDAP-seq we profile twenty pairs of C/S1 bZIP heterodimers and S1 homodimers inArabidopsisand show that heterodimerization significantly expands the DNA binding preferences of these TFs. Analysis of dDAP-seq binding sites reveals the function of bZIP9 in abscisic acid response and the role of bZIP53 heterodimer-specific binding in seed maturation. The C/S1 heterodimers show distinct preferences for the ACGT elements recognized by plant bZIPs and motifs resembling the yeast GCN4cis-elements. This study demonstrates the potential of dDAP-seq in deciphering the DNA binding specificities of interacting TFs that are key for combinatorial gene regulation.more » « less
An official website of the United States government

