Abstract Finding similarities between model parameters across different catchments has proved to be challenging. Existing approaches struggle due to catchment heterogeneity and non‐linear dynamics. In particular, attempts to correlate catchment attributes with hydrological responses have failed due to interdependencies among variables and consequent equifinality. Machine Learning (ML), particularly the Long Short‐Term Memory (LSTM) approach, has demonstrated strong predictive and spatial regionalization performance. However, understanding the nature of the regionalization relationships remains difficult. This study proposes a novel approach to partially decouple learning the representation of (a) catchment dynamics by using theHydroLSTMarchitecture and (b) spatial regionalization relationships by using aRandom Forest(RF) clustering approach to learn the relationships between the catchment attributes and dynamics. This coupled approach, calledRegional HydroLSTM, learns a representation of “potential streamflow” using a single cell‐state, while the output gate corrects it to correspond to the temporal context of the current hydrologic regime. RF clusters mediate the relationship between catchment attributes and dynamics, allowing identification of spatially consistent hydrological regions, thereby providing insight into the factors driving spatial and temporal hydrological variability. Results suggest that by combining complementary architectures, we can enhance the interpretability of regional machine learning models in hydrology, offering a new perspective on the “catchment classification” problem. We conclude that an improved understanding of the underlying nature of hydrologic systems can be achieved by careful design of ML architectures to target the specific things we are seeking to learn from the data.
more »
« less
Geometry of gene regulatory dynamics
Embryonic development leads to the reproducible and ordered appearance of complexity from egg to adult. The successive differentiation of different cell types that elaborate this complexity results from the activity of gene networks and was likened by Waddington to a flow through a landscape in which valleys represent alternative fates. Geometric methods allow the formal representation of such landscapes and codify the types of behaviors that result from systems of differential equations. Results from Smale and coworkers imply that systems encompassing gene network models can be represented as potential gradients with a Riemann metric, justifying the Waddington metaphor. Here, we extend this representation to include parameter dependence and enumerate all three-way cellular decisions realizable by tuning at most two parameters, which can be generalized to include spatial coordinates in a tissue. All diagrams of cell states vs. model parameters are thereby enumerated. We unify a number of standard models for spatial pattern formation by expressing them in potential form (i.e., as topographic elevation). Turing systems appear nonpotential, yet in suitable variables the dynamics are low dimensional and potential. A time-independent embedding recovers the original variables. Lateral inhibition is described by a saddle point with many unstable directions. A model for the patterning of theDrosophilaeye appears as relaxation in a bistable potential. Geometric reasoning provides intuitive dynamic models for development that are well adapted to fit time-lapse data.
more »
« less
- Award ID(s):
- 2013131
- PAR ID:
- 10308321
- Publisher / Repository:
- Proceedings of the National Academy of Sciences
- Date Published:
- Journal Name:
- Proceedings of the National Academy of Sciences
- Volume:
- 118
- Issue:
- 38
- ISSN:
- 0027-8424
- Page Range / eLocation ID:
- Article No. e2109729118
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Khila, Abderrahman (Ed.)The evolution of sexual secondary characteristics necessitates regulatory factors that confer sexual identity to differentiating tissues and cells. InColias eurythemebutterflies, males exhibit two specialized wing scale types—ultraviolet-iridescent (UVI) and spatulate scales—which are absent in females and likely integral to male courtship behavior. This study investigates the regulatory mechanisms and single-nucleus transcriptomics underlying these two sexually dimorphic cell types during wing development. We show thatDoublesex(Dsx) expression is itself dimorphic and required to repress the UVI cell state in females, while unexpectedly, UVI activation in males is independent fromDsx. In the melanic marginal band,Dsxis required in each sex to enforce the presence of spatulate scales in males, and their absence in females. Single-nucleus RNAseq reveals that UVI and spatulate scale cell precursors each show distinctive gene expression profiles at 40% of pupal development, with marker genes that include regulators of transcription, cell signaling, cytoskeletal patterning, and chitin secretion. Both male-specific cell types share a low expression of theBric-a-brac(Bab) transcription factor, a key repressor of the UVI fate. Bab ChIP-seq profiling suggests that Bab binds thecis-regulatory regions of gene markers associated to UVI fate, including potential effector genes involved in the regulation of cytoskeletal processes and chitin secretion, and loci showing signatures of recent selective sweeps in a UVI-polymorphic population. These findings open new avenues for exploring wing patterning and scale development, shedding light on the mechanisms driving the specification of sex-specific cell states and the differentiation of specialized cell ultrastructures.more » « less
-
Abstract BackgroundThe biophysics of an organism span multiple scales from subcellular to organismal and include processes characterized by spatial properties, such as the diffusion of molecules, cell migration, and flow of intravenous fluids. Mathematical biology seeks to explain biophysical processes in mathematical terms at, and across, all relevant spatial and temporal scales, through the generation of representative models. While non-spatial, ordinary differential equation (ODE) models are often used and readily calibrated to experimental data, they do not explicitly represent the spatial and stochastic features of a biological system, limiting their insights and applications. However, spatial models describing biological systems with spatial information are mathematically complex and computationally expensive, which limits the ability to calibrate and deploy them and highlights the need for simpler methods able to model the spatial features of biological systems. ResultsIn this work, we develop a formal method for deriving cell-based, spatial, multicellular models from ODE models of population dynamics in biological systems, and vice versa. We provide examples of generating spatiotemporal, multicellular models from ODE models of viral infection and immune response. In these models, the determinants of agreement of spatial and non-spatial models are the degree of spatial heterogeneity in viral production and rates of extracellular viral diffusion and decay. We show how ODE model parameters can implicitly represent spatial parameters, and cell-based spatial models can generate uncertain predictions through sensitivity to stochastic cellular events, which is not a feature of ODE models. Using our method, we can test ODE models in a multicellular, spatial context and translate information to and from non-spatial and spatial models, which help to employ spatiotemporal multicellular models using calibrated ODE model parameters. We additionally investigate objects and processes implicitly represented by ODE model terms and parameters and improve the reproducibility of spatial, stochastic models. ConclusionWe developed and demonstrate a method for generating spatiotemporal, multicellular models from non-spatial population dynamics models of multicellular systems. We envision employing our method to generate new ODE model terms from spatiotemporal and multicellular models, recast popular ODE models on a cellular basis, and generate better models for critical applications where spatial and stochastic features affect outcomes.more » « less
-
Summary Pluripotency, the ability of cells to self-renew and differentiate into all the cell types in an animal’s body, is vital for mammalian early development. This study presented a comprehensive comparative transcriptomic analysis of embryonic stem cells across multiple mammalian species, defining their progression through expanded/extended, naïve, formative, and primed pluripotency states. Our findings revealed both conserved and species-specific mechanisms underlying pluripotency regulation. We also emphasized the limitations of existing state-specific markers and their limited cross-species applicability, while identifyingde novopluripotency markers that can inform future research. Despite variability in gene expression dynamics, gene co-expression networks showed remarkable conservation across species. Among pluripotency states, the primed state demonstrated the highest conservation, evidenced by shared markers, preserved gene networks, and stronger selective pressures acting on its genes. These findings provide critical insights into the evolution and regulation of pluripotency, laying a foundation for refining stem cell models to enhance their translational potential in regenerative medicine, agriculture, and conservation biology.more » « less
-
Systematic enumeration and identification of unique 3D spatial topologies (STs) of complex engineering systems (such as automotive cooling systems, electric power trains, satellites, and aero-engines) are essential to navigation of these expansive design spaces with the goal of identifying new spatial configurations that can satisfy challenging system requirements. However, efficient navigation through discrete 3D ST options is a very challenging problem due to its combinatorial nature and can quickly exceed human cognitive abilities at even moderate complexity levels. This article presents a new, efficient, and scalable design framework that leverages mathematical spatial graph theory to represent, enumerate, and identify distinctive 3D topological classes for a generic 3D engineering system, given its system architecture (SA)—its components and their interconnections. First, spatial graph diagrams (SGDs) are generated for a given SA from zero to a specified maximum number of interconnect crossings. Then, corresponding Yamada polynomials for all the planar SGDs are generated. SGDs are categorized into topological classes, each of which shares a unique Yamada polynomial. Finally, within each topological class, 3D geometric models are generated using the SGDs having different numbers of interconnect crossings. Selected case studies are presented to illustrate the different features of our proposed framework, including an industrial engineering design application: ST enumeration of a 3D automotive fuel cell cooling system (AFCS). Design guidelines are also provided for practicing engineers to aid the application of this framework to different types of real-world problems such as configuration design and spatial packaging optimization.more » « less
An official website of the United States government
