skip to main content


Title: Statistical inference of discrete combinatorial functional dependency in biological systems
Inference of a combinatorial function from multiple independent variables (parents) to a dependent variable (child) in a discrete space can be useful in detecting nonlinear relationships in biological systems. Popular conditional independency measures, heavily used in combinatorial inference, are often insensitive to the direction of functional dependency. To address this issue, we define multivariate and conditional functional chi-squared statistics. We also present an algorithm called CFDF for bivariate discrete function inference via an exclusive-effect strategy, in order to identify a best parent set for a given child. It requires each parent to make sufficient contribution beyond any marginal effect. Simulation studies suggest a marked advantage of our framework over alternatives. Applying the method to transcriptome data in genetically perturbed biological systems, we reproduced combinatorial gene interactions known in the literature. Most importantly, we identified combinatorial patterns from joint RNA and protein data to rebut a dispute on the founding principle of molecular biology.  more » « less
Award ID(s):
1661331
NSF-PAR ID:
10168084
Author(s) / Creator(s):
;
Date Published:
Journal Name:
Proceedings of the 14th Machine Learning in Computational Biology (MLCB) Meeting
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. As severe dropout in single-cell RNA sequencing (scRNA-seq) degrades data quality, current methods for network inference face increased uncertainty from such data. To examine how dropout influences directional dependency inference from scRNA-seq data, we thus studied four methods based on discrete data that are model-free without parametric model assumptions. They include two established methods: conditional entropy and Kruskal-Wallis test, and two recent methods: causal inference by stochastic complexity and function index. We also included three non-directional methods for a contrast. On simulated data, function index performed most favorably at varying dropout rates, sample sizes, and discrete levels. On an scRNA-seq dataset from developing mouse cerebella, function index and Kruskal-Wallis test performed favorably over other methods in detecting expression of developmental genes as a function of time. Overall among the four methods, function index is most resistant to dropout for both directional and dependency inference. The next best choice, Kruskal-Wallis test, carries a directional bias towards a uniformly distributed variable. We conclude that a method robust to marginal distributions with a sufficiently large sample size can reap benefits of single-cell over bulk RNA sequencing in understanding molecular mechanisms at the cellular resolution. 
    more » « less
  2. Background

    Research to date has largely conceptualized irritability in terms of intraindividual differences. However, the role of interpersonal dyadic processes has received little consideration. Nevertheless, difficulties in how parent–child dyads synchronize during interactions may be an important correlate of irritably in early childhood. Innovations in developmentally sensitive neuroimaging methods now enable the use of measures of neural synchrony to quantify synchronous responses in parent–child dyads and can help clarify the neural underpinnings of these difficulties. We introduce the Disruptive Behavior Diagnostic Observation Schedule: Biological Synchrony (DB‐DOS:BioSync) as a paradigm for exploring parent–child neural synchrony as a potential biological mechanism for interpersonal difficulties in preschool psychopathology.

    Methods

    Using functional near‐infrared spectroscopy (fNIRS) 4‐ to 5‐year‐olds (N = 116) and their mothers completed the DB‐DOS:BioSync while assessing neural synchrony during mild frustration and recovery. Child irritability was measured using a latent irritability factor that was calculated from four developmentally sensitive indicators.

    Results

    Both the mild frustration and the recovery contexts resulted in neural synchrony. However, less neural synchrony during the recovery context only was associated with more child irritability.

    Conclusions

    Our results suggest that recovering after a frustrating period might be particularly challenging for children high in irritability and offer support for the use of the DB‐DOS:BioSync task to elucidate interpersonal neural mechanisms of developmental psychopathology.

     
    more » « less
  3. Despite its benefits for children’s skill development and parent-child bonding, many parents do not often engage in interactive storytelling by having story-related dialogues with their child due to limited availability or challenges in coming up with appropriate questions. While recent advances made AI generation of questions from stories possible, the fully-automated approach excludes parent involvement, disregards educational goals, and underoptimizes for child engagement. Informed by need-finding interviews and participatory design (PD) results, we developed StoryBuddy, an AI-enabled system for parents to create interactive storytelling experiences. StoryBuddy’s design highlighted the need for accommodating dynamic user needs between the desire for parent involvement and parent-child bonding and the goal of minimizing parent intervention when busy. The PD revealed varied assessment and educational goals of parents, which StoryBuddy addressed by supporting configuring question types and tracking child progress. A user study validated StoryBuddy’s usability and suggested design insights for future parent-AI collaboration systems. 
    more » « less
  4. null (Ed.)
    Parent-child similarities and discrepancies at multiple levels provide a window to understand the cultural transmission process. Although prior research has examined parent-child similarities at the belief, behavioral, and physiological levels across cultures, little is known about parent-child similarities at the neural level. The current review introduces an interdisciplinary computational cultural neuroscience approach, which utilizes computational methods to understand neural and psychological processes being involved during parent-child interactions at intra- and inter-personal level. This review provides three examples, including the application of intersubject representational similarity analysis to analyze naturalistic neuroimaging data, the usage of computer vision to capture non-verbal social signals during parent-child interactions, and unraveling the psychological complexities involved during real-time parent-child interactions based on their simultaneous recorded brain response patterns. We hope that this computational cultural neuroscience approach can provide researchers an alternative way to examine parent-child similarities and discrepancies across different cultural contexts and gain a better understanding of cultural transmission processes. 
    more » « less
  5. In the absence of data from a randomized trial, researchers may aim to use observational data to draw causal inference about the effect of a treatment on a time-to-event outcome. In this context, interest often focuses on the treatment-specific survival curves, that is, the survival curves were the population under study to be assigned to receive the treatment or not. Under certain conditions, including that all confounders of the treatment-outcome relationship are observed, the treatment-specific survival curve can be identified with a covariate-adjusted survival curve. In this article, we propose a novel cross-fitted doubly-robust estimator that incorporates data-adaptive (e.g. machine learning) estimators of the conditional survival functions. We establish conditions on the nuisance estimators under which our estimator is consistent and asymptotically linear, both pointwise and uniformly in time. We also propose a novel ensemble learner for combining multiple candidate estimators of the conditional survival estimators. Notably, our methods and results accommodate events occurring in discrete or continuous time, or an arbitrary mix of the two. We investigate the practical performance of our methods using numerical studies and an application to the effect of a surgical treatment to prevent metastases of parotid carcinoma on mortality. 
    more » « less