Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Mathelier, Anthony (Ed.)Abstract MotivationStochastic gene expression and cell-to-cell heterogeneity have attracted increased interest in recent years, enabled by advances in single-cell measurement technologies. These studies are also increasingly complemented by quantitative biophysical modeling, often using the framework of stochastic biochemical kinetic models. However, inferring parameters for such models (i.e., the kinetic rates of biochemical reactions) remains a technical and computational challenge, particularly doing so in a manner that can leverage high-throughput single-cell sequencing data. ResultsIn this work, we develop a chemical master equation model reference library-based computational pipeline to infer kinetic parameters describing noisy mRNA distributions from single-cell RNA sequencing data, using the commonly applied stochastic telegraph model. The approach fits kinetic parameters via steady-state distributions, as measured across a population of cells in snapshot data. Our pipeline also serves as a tool for comprehensive analysis of parameter identifiability, in both a priori (studying model properties in the absence of data) and a posteriori (in the context of a particular dataset) use-cases. The pipeline can perform both of these tasks, i.e. inference and identifiability analysis, in an efficient and scalable manner, and also serves to disentangle contributions to uncertainty in inferred parameters from experimental noise versus structural properties of the model. We found that for the telegraph model, the majority of the parameter space is not practically identifiable from single-cell RNA sequencing data, and low experimental capture rates worsen the identifiability. Our methodological framework could be extended to other data types in the fitting of small biochemical network models. Availability and implementationAll code relevant to this work is available at https://github.com/Read-Lab-UCI/TelegraphLikelihoodInfer, archival DOI: https://doi.org/10.5281/zenodo.16915450.more » « lessFree, publicly-accessible full text available November 1, 2026
-
Abstract The 3D organization of the genome—in particular, which two regions of DNA are in contact with each other—plays a role in regulating gene expression. Several factors influence genome 3D organization. Nucleosomes (where ∼100 base pairs of DNA wrap around histone proteins) bend, twist, and compactify chromosomal DNA, altering its polymer mechanics. How much does the positioning of nucleosomes between gene loci influence contacts between those gene loci? And to what extent are polymer mechanics responsible for this? To address this question, we combine a stochastic polymer mechanics model of chromosomal DNA including twists and wrapping induced by nucleosomes with two data-driven pipelines. The first estimates nucleosome positioning from ATAC-seq data in regions of high accessibility. Most of the genome is low accessibility, so we combine this with a novel image analysis method that estimates the distribution of nucleosome spacing from electron microscopy data. There are no fit parameters in the biophysical model. We apply this method to IL-6, IL-15, CXCL9, and CXCL10, inflammatory marker genes in macrophages, before and after inflammatory stimulation, and compare the predictions with contacts measured by conformation capture experiments (4C-seq). We find that within a 500-kb genomic region, polymer mechanics with nucleosomes can explain 71% of close contacts. These results suggest that, while genome contacts on 100 kb scales are multifactorial, they may be amenable to mechanistic, physical explanation. Our work also highlights the role of nucleosomes, not just at the loci of interest, but between them, and not just the total number of nucleosomes, but their specific placement. The method generalizes to other genes, and can be used to address whether a contact is under active regulation by the cell (e.g. a macrophage during inflammatory stimulation).more » « less
-
Genome function requires regulated genome motion. However, tools to directly observe this motion in vivo have been limited in coverage and resolution. Here we introduce an approach to tile mammalian chromosomes with self-mapping fluorescent labels and track them at ultraresolution. We find that sequences separated by submegabase distances transition to proximity in tens of seconds. This rapid search is dependent on cohesin and is exhibited only within domains. Domain borders act as kinetic impediments to this search process, rather than structural boundaries. The genomic separation–dependent scaling of the search time for cis interactions violated predictions of diffusion, suggesting motor-driven folding. We also uncover cohesin-dependent processive motion at 2.7 kilobases per second. Together, these multiscale dynamics reveal the organization of the genome into kinetically associated domains.more » « lessFree, publicly-accessible full text available September 18, 2026
-
Euchromatin is an accessible phase of genetic material containing genes that encode proteins with increased expression levels. The structure of euchromatin in vitro has been described as a 30-nm fiber formed from ordered nucleosome arrays. However, recent advances in microscopy have revealed an in vivo euchromatin architecture that is much more disordered, characterized by variable-length linker DNA and sporadic nucleosome clusters. In this work, we develop a theoretical model to elucidate factors contributing to the disordered in vivo architecture of euchromatin. We begin by developing a 1D model of nucleosome positioning that captures the interactions between bound epigenetic reader proteins to predict the distribution of DNA linker lengths between adjacent nucleosomes. We then use the predicted linker lengths to construct 3D chromatin configurations consistent with the physical properties of DNA within the nucleosome array, and we evaluate the distribution of nucleosome cluster sizes in those configurations. Our model reproduces experimental cluster-size distributions, which are dramatically influenced by the local pattern of epigenetic marks and the concentration of reader proteins. Based on our model, we attribute the disordered arrangement of euchromatin to the heterogeneous binding of reader proteins and subsequent short-range interactions between bound reader proteins on adjacent nucleosomes. By replicating experimental results with our physics-based model, we propose a mechanism for euchromatin organization in the nucleus that impacts gene regulation and the maintenance of epigenetic marks.more » « less
An official website of the United States government
