An autonomous, environmentally synchronizable circadian rhythm is a ubiquitous feature of life on Earth. In multicellular organisms, this rhythm is generated by a transcription-translation feedback loop present in nearly every cell that drives daily expression of thousands of genes in a tissue-dependent manner. Identifying the genes that are under circadian control can elucidate the mechanisms by which physiological processes are coordinated in multicellular organisms. Today, transcriptomic profiling at the single-cell level provides an unprecedented opportunity to understand the function of cell-level clocks. However, while many cycling detection algorithms have been developed to identify genes under circadian control in bulk transcriptomic data, it is not known how best to adapt these algorithms to single-cell RNA seq data. Here, we benchmark commonly used circadian detection methods on their reliability and efficiency when applied to single-cell RNA seq data. Our results provide guidance on adapting existing cycling detection methods to the single-cell domain and elucidate opportunities for more robust and efficient rhythm detection in single-cell data. We also propose a subsampling procedure combined with harmonic regression as an efficient strategy to detect circadian genes in the single-cell setting.
An autonomous, environmentally-synchronizable circadian rhythm is a ubiquitous feature of life on Earth. In multicellular organisms, this rhythm is generated by a transcription–translation feedback loop present in nearly every cell that drives daily expression of thousands of genes in a tissue–dependent manner. Identifying the genes that are under circadian control can elucidate the mechanisms by which physiological processes are coordinated in multicellular organisms. Today, transcriptomic profiling at the single-cell level provides an unprecedented opportunity to understand the function of cell-level clocks. However, while many cycling detection algorithms have been developed to identify genes under circadian control in bulk transcriptomic data, it is not known how best to adapt these algorithms to single-cell RNAseq data. Here, we benchmark commonly used circadian detection methods on their reliability and efficiency when applied to single cell RNAseq data. Our results provide guidance on adapting existing cycling detection methods to the single-cell domain, and elucidate opportunities for more robust and efficient rhythm detection in single-cell data. We also propose a subsampling procedure combined with harmonic regression as an efficient, reliable strategy to detect circadian genes in the single–cell setting.
more » « less- Award ID(s):
- 1764421
- PAR ID:
- 10525843
- Publisher / Repository:
- bioRxiv
- Date Published:
- Format(s):
- Medium: X
- Institution:
- bioRxiv
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The circadian rhythm drives the oscillatory expression of thousands of genes across all tissues, coordinating physiological processes. The effect of this rhythm on health has generated increasing interest in discovering genes under circadian control by searching for periodic patterns in transcriptomic time-series experiments. While algorithms for detecting cycling transcripts have advanced, there remains little guidance quantifying the effect of experimental design and analysis choices on cycling detection accuracy. We present TimeTrial, a user-friendly benchmarking framework using both real and synthetic data to investigate cycle detection algorithms’ performance and improve circadian experimental design. Results show that the optimal choice of analysis method depends on the sampling scheme, noise level, and shape of the waveform of interest and provides guidance on the impact of sampling frequency and duration on cycling detection accuracy. The TimeTrial software is freely available for download and may also be accessed through a web interface. By supplying a tool to vary and optimize experimental design considerations, TimeTrial will enhance circadian transcriptomics studies.
-
Motivation The circadian rhythm drives the oscillatory expression of thousands of genes across all tissues. The recent revolution in high-throughput transcriptomics, coupled with the significant implications of the circadian clock for human health, has sparked an interest in circadian profiling studies to discover genes under circadian control. Result We present TimeCycle: a topology-based rhythm detection method designed to identify cycling transcripts. For a given time-series, the method reconstructs the state space using time-delay embedding, a data transformation technique from dynamical systems theory. In the embedded space, Takens’ theorem proves that the dynamics of a rhythmic signal will exhibit circular patterns. The degree of circularity of the embedding is calculated as a persistence score using persistent homology, an algebraic method for discerning the topological features of data. By comparing the persistence scores to a bootstrapped null distribution, cycling genes are identified. Results in both synthetic and biological data highlight Time-Cycle’s ability to identify cycling genes across a range of sampling schemes, number of replicates, and missing data. Comparison to competing methods highlights their relative strengths, providing guidance as to the optimal choice of cycling detection method. Availability and Implementation A fully documented open-source R package implementing Time-Cycle is available at: https://nesscoder.github.io/TimeCycle/.more » « less
-
Abstract In mammals, T-cell migration is under circadian control, likely to anticipate daily rhythms in infection risk. Glucocorticoids are a major controller of circadian processes and malnutrition is associated with increased glucocorticoid secretion. Previous studies suggest malnutrition may impart a “super-quiescent” phenotype to T-cells, enabling a greater number of naïve T-cells to survive short-term malnutrition albeit with diminished function. Thus, we hypothesize that malnourished T-cells may conserve energy by disengaging from rhythmic migration under circadian control and/or foregoing migration to reside in the bone marrow instead. To test this hypothesis, the total number of nucleated cells and naïve CD4+ and CD8+ T-cells in the blood, spleen, bone marrow, and brachial and mesenteric lymph nodes were enumerated by flow cytometry every four hours over the course of one day from control and malnourished mice. Additionally, expression levels of CD127 and CXCR4 in both T-cell populations and the concentration of glucocorticoids in the blood were assessed. A better understanding of how malnutrition affects the circadian rhythm of T-cell migration will not only help identify the mechanisms of how circadian rhythms work, but also how organisms’ circadian rhythms change in response to malnutrition. This knowledge of how malnutrition disrupts the circadian rhythm of T-cells may help improve vaccination strategies in malnourished children. Supported by NSF-MRI [DBI- 1920116] NSF -RUI [IOS-1951881]more » « less
-
Abstract The circadian clock is a central driver of many biological and behavioral processes, regulating the levels of many genes and proteins, termed clock controlled genes and proteins (CCGs/CCPs), to impart biological timing at the molecular level. While transcriptomic and proteomic data has been analyzed to find potential CCGs and CCPs, multi-omic modeling of circadian data, which has the potential to enhance the understanding of circadian control of biological timing, remains relatively rare due to several methodological hurdles. To address this gap, a Dual-approach Co-expression Analysis Framework (D-CAF) was created to perform perturbation-robust co-expression analysis on time-series measurements of both transcripts and proteins. Applying this D-CAF framework to previously gathered transcriptomic and proteomic data from mouse macrophages gathered over circadian time, we identified small, highly significant clusters of oscillating transcripts and proteins in the unweighted similarity matrices and larger, less significant clusters of of oscillating transcripts and proteins using the weighted similarity network. Functional enrichment analysis of these clusters identified novel immunological response pathways that appear to be under circadian control. Overall, our findings suggest that D-CAF is a tool that can be used by the circadian community to integrate multi-omic circadian data to improve our understanding of the mechanisms of circadian regulation of molecular processes.