skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on December 1, 2025

Title: Machine Learning Inference of Gene Regulatory Networks in Developing Mimulus Seeds
The angiosperm seed represents a critical evolutionary breakthrough that has been shown to propel the reproductive success and radiation of flowering plants. Seeds promote the rapid diversification of angiosperms by establishing postzygotic reproductive barriers, such as hybrid seed inviability. While prezygotic barriers to reproduction tend to be transient, postzygotic barriers are often permanent and therefore can play a pivotal role in facilitating speciation. This property of the angiosperm seed is exemplified in the Mimulus genus. In order to further the understanding of the gene regulatory mechanisms important in the Mimulus seed, we performed gene regulatory network (GRN) inference analysis by using time-series RNA-seq data from developing hybrid seeds from a viable cross between Mimulus guttatus and Mimulus pardalis. GRN inference has the capacity to identify active regulatory mechanisms in a sample and highlight genes of potential biological importance. In our case, GRN inference also provided the opportunity to uncover active regulatory relationships and generate a reference set of putative gene regulations. We deployed two GRN inference algorithms—RTP-STAR and KBoost—on three different subsets of our transcriptomic dataset. While the two algorithms yielded GRNs with different regulations and topologies when working with the same data subset, there was still significant overlap in the specific gene regulations they inferred, and they both identified potential novel regulatory mechanisms that warrant further investigation.  more » « less
Award ID(s):
1856223
PAR ID:
10600351
Author(s) / Creator(s):
; ;
Publisher / Repository:
Plants
Date Published:
Journal Name:
Plants
Volume:
13
Issue:
23
ISSN:
2223-7747
Page Range / eLocation ID:
3297
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Endosperm is an angiosperm innovation central to their reproduction whose development, and thus seed viability, is controlled by genomic imprinting, where expression from certain genes is parent-specific. Unsuccessful imprinting has been linked to failed inter-specific and inter-ploidy hybridization. Despite their importance in plant speciation, the underlying mechanisms behind these endosperm-based barriers remain poorly understood. Here, we describe one such barrier between diploid Mimulus guttatus and tetraploid Mimulus luteus. The two parents differ in endosperm DNA methylation, expression dynamics, and imprinted genes. Hybrid seeds suffer from underdeveloped endosperm, reducing viability, or arrested endosperm and seed abortion when M. guttatus or M. luteus is seed parent, respectively, and transgressive methylation and expression patterns emerge. The two inherited M. luteus subgenomes, genetically distinct but epigenetically similar, are expressionally dominant over the M. guttatus genome in hybrid embryos and especially their endosperm, where paternal imprints are perturbed. In aborted seeds, de novo methylation is inhibited, potentially owing to incompatible paternal instructions of imbalanced dosage from M. guttatus imprints. We suggest that diverged epigenetic/regulatory landscapes between parental genomes induce epigenetic repatterning and global shifts in expression, which, in endosperm, may uniquely facilitate incompatible interactions between divergent imprinting schemes, potentially driving rapid barriers. 
    more » « less
  2. Abstract Postmating reproductive isolation can help maintain species boundaries when premating barriers to reproduction are incomplete. The strength and identity of postmating reproductive barriers are highly variable among diverging species, leading to questions about their genetic basis and evolutionary drivers. These questions have been tackled in model systems but are less often addressed with broader phylogenetic resolution. In this study we analyse patterns of genetic divergence alongside direct measures of postmating reproductive barriers in an overlooked group of sympatric species within the model monkeyflower genus, Mimulus. Within this Mimulus brevipes species group, we find substantial divergence among species, including a cryptic genetic lineage. However, rampant gene discordance and ancient signals of introgression suggest a complex history of divergence. In addition, we find multiple strong postmating barriers, including postmating prezygotic isolation, hybrid seed inviability and hybrid male sterility. M. brevipes and M. fremontii have substantial but incomplete postmating isolation. For all other tested species pairs, we find essentially complete postmating isolation. Hybrid seed inviability appears linked to differences in seed size, providing a window into possible developmental mechanisms underlying this reproductive barrier. While geographic proximity and incomplete mating isolation may have allowed gene flow within this group in the distant past, strong postmating reproductive barriers today have likely played a key role in preventing ongoing introgression. By producing foundational information about reproductive isolation and genomic divergence in this understudied group, we add new diversity and phylogenetic resolution to our understanding of the mechanisms of plant speciation. Abstract Hybrid seed inviability and other postmating reproductive barriers isolate species in Mimulus section Eunanus. Variation in seed size may help explain hybrid seed failure. Whole-genome sequencing indicates a complex history of divergence, including signals of ancient introgression and cryptic diversity. 
    more » « less
  3. Inferring gene regulatory networks (GRNs) from single-cell gene expression datasets is a challenging task. Existing methods are often designed heuristically for specific datasets and lack the flexibility to incorporate additional information or compare against other algorithms. Further, current GRN inference methods do not provide uncertainty estimates with respect to the interactions that they predict, making inferred networks challenging to interpret. To overcome these challenges, we introduce Probabilistic Matrix Factorization for Gene Regulatory Network inference (PMF-GRN). PMF-GRN uses single-cell gene expression data to learn latent factors representing transcription factor activity as well as regulatory relationships between transcription factors and their target genes. This approach incorporates available experimental evidence into prior distributions over latent factors and scales well to single-cell gene expression datasets. By utilizing variational inference, we facilitate hyperparameter search for principled model selection and direct comparison to other generative models. To assess the accuracy of our method, we evaluate PMF-GRN using the model organisms Saccharomyces cerevisiae and Bacillus subtilis, benchmarking against database-derived gold standard interactions. We discover that, on average, PMF-GRN infers GRNs more accurately than current state-of-the-art single-cell GRN inference methods. Moreover, our PMF-GRN approach offers well-calibrated uncertainty estimates, as it performs gene regulatory network (GRN) inference in a probabilistic setting. These estimates are valuable for validation purposes, particularly when validated interactions are limited or a gold standard is incomplete. 
    more » « less
  4. Abstract The genetic dissection of reproductive barriers between diverging lineages provides enticing clues into the origin of species. One strategy uses linkage analysis in experimental crosses to identify genomic locations involved in phenotypes that mediate reproductive isolation. A second framework searches for genomic regions that show reduced rates of exchange across natural hybrid zones. It is often assumed that these approaches will point to the same loci, but this assumption is rarely tested. In this perspective, we discuss the factors that determine whether loci connected to postzygotic reproductive barriers in the laboratory are inferred to reduce gene flow in nature. We synthesize data on the genetics of postzygotic isolation in house mice, one of the most intensively studied systems in speciation genetics. In a rare empirical comparison, we measure the correspondence of loci tied to postzygotic barriers via genetic mapping in the laboratory and loci at which gene flow is inhibited across a natural hybrid zone. We find no evidence that the two sets of loci overlap beyond what is expected by chance. In light of these results, we recommend avenues for empirical and theoretical research to resolve the potential incongruence between the two predominant strategies for understanding the genetics of speciation. 
    more » « less
  5. Divergent natural selection has the potential to drive the evolution of reproductive isolation. The euryhaline killifish Lucania parva has stable populations in both fresh water and salt water. Lucania parva and its sister species, the freshwater L. goodei , are isolated by both prezygotic and postzygotic barriers. To further test whether adaptation to salinity has led to the evolution of these isolating barriers, we tested for incipient reproductive isolation within L. parva by crossing freshwater and saltwater populations. We found no evidence for prezygotic isolation, but reduced hybrid survival indicated that postzygotic isolation existed between L. parva populations. Therefore, postzygotic isolation evolved before prezygotic isolation in these ecologically divergent populations. Previous work on these species raised eggs with methylene blue, which acts as a fungicide. We found this fungicide distorts the pattern of postzygotic isolation by increasing fresh water survival in L. parva , masking species/population differences, and underestimating hybrid inviability. 
    more » « less