Search for: All records

Creators/Authors contains: "Uhler, Caroline"

« Prev Next »

Total Resources

35

Resource Type
Conference Paper

0

Conference Proceeding

0

Dataset

0

Journal Article

35

Workshop Report

0

Availability
Full Text / Resource Available

35

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Transfer Learning with Kernel Methods

https://doi.org/10.1038/s41467-023-41215-8

Radhakrishnan, Adityanarayanan ; Ruiz Luyten, Max ; Prasad, Neha ; Uhler, Caroline ( September 2023 , Nature Communications)

Abstract
Transfer learning refers to the process of adapting a model trained on a source task to a target task. While kernel methods are conceptually and computationally simple models that are competitive on a variety of tasks, it has been unclear how to develop scalable kernel-based transfer learning methods across general source and target tasks with possibly differing label dimensions. In this work, we propose a transfer learning framework for kernel methods by projecting and translating the source model to the target task. We demonstrate the effectiveness of our framework in applications to image classification and virtual drug screening. For both applications, we identify simple scaling laws that characterize the performance of transfer-learned kernels as a function of the number of target examples. We explain this phenomenon in a simplified linear setting, where we are able to derive the exact scaling laws.

more » « less
The DeCAMFounder: nonlinear causal discovery in the presence of hidden variables

https://doi.org/10.1093/jrsssb/qkad071

Agrawal, Raj ; Squires, Chandler ; Prasad, Neha ; Uhler, Caroline ( July 2023 , Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract
Many real-world decision-making tasks require learning causal relationships between a set of variables. Traditional causal discovery methods, however, require that all variables are observed, which is often not feasible in practical scenarios. Without additional assumptions about the unobserved variables, it is not possible to recover any causal relationships from observational data. Fortunately, in many applied settings, additional structure among the confounders can be expected. In particular, pervasive confounding is commonly encountered and has been utilised for consistent causal estimation in linear causal models. In this article, we present a provably consistent method to estimate causal relationships in the nonlinear, pervasive confounding setting. The core of our procedure relies on the ability to estimate the confounding variation through a simple spectral decomposition of the observed data matrix. We derive a DAG score function based on this insight, prove its consistency in recovering a correct ordering of the DAG, and empirically compare it to previous approaches. We demonstrate improved performance on both simulated and real datasets by explicitly accounting for both confounders and nonlinear effects.

more » « less
Wide and deep neural networks achieve consistency for classification

https://doi.org/10.1073/pnas.2208779120

Radhakrishnan, Adityanarayanan ; Belkin, Mikhail ; Uhler, Caroline ( April 2023 , Proceedings of the National Academy of Sciences)

While neural networks are used for classification tasks across domains, a long-standing open problem in machine learning is determining whether neural networks trained using standard procedures are consistent for classification, i.e., whether such models minimize the probability of misclassification for arbitrary data distributions. In this work, we identify and construct an explicit set of neural network classifiers that are consistent. Since effective neural networks in practice are typically both wide and deep, we analyze infinitely wide networks that are also infinitely deep. In particular, using the recent connection between infinitely wide neural networks and neural tangent kernels, we provide explicit activation functions that can be used to construct networks that achieve consistency. Interestingly, these activation functions are simple and easy to implement, yet differ from commonly used activations such as ReLU or sigmoid. More generally, we create a taxonomy of infinitely wide and deep networks and show that these models implement one of three well-known classifiers depending on the activation function used: 1) 1-nearest neighbor (model predictions are given by the label of the nearest training example); 2) majority vote (model predictions are given by the label of the class with the greatest representation in the training set); or 3) singular kernel classifiers (a set of classifiers containing those that achieve consistency). Our results highlight the benefit of using deep networks for classification tasks, in contrast to regression tasks, where excessive depth is harmful.
more » « less
Full Text Available
Graph-based autoencoder integrates spatial transcriptomics with chromatin images and identifies joint biomarkers for Alzheimer’s disease

https://doi.org/10.1038/s41467-022-35233-1

Zhang, Xinyi ; Wang, Xiao ; Shivashankar, G. V. ; Uhler, Caroline ( December 2022 , Nature Communications)

Abstract
Tissue development and disease lead to changes in cellular organization, nuclear morphology, and gene expression, which can be jointly measured by spatial transcriptomic technologies. However, methods for jointly analyzing the different spatial data modalities in 3D are still lacking. We present a computational framework to integrate Spatial Transcriptomic data using over-parameterized graph-based Autoencoders with Chromatin Imaging data (STACI) to identify molecular and functional alterations in tissues. STACI incorporates multiple modalities in a single representation for downstream tasks, enables the prediction of spatial transcriptomic data from nuclear images in unseen tissue sections, and provides built-in batch correction of gene expression and tissue morphology through over-parameterization. We apply STACI to analyze the spatio-temporal progression of Alzheimer’s disease and identify the associated nuclear morphometric and coupled gene expression features. Collectively, we demonstrate the importance of characterizing disease progression by integrating multiple data modalities and its potential for the discovery of disease biomarkers.

more » « less
Cross-modal autoencoder framework learns holistic representations of cardiovascular state

https://doi.org/10.1038/s41467-023-38125-0

Radhakrishnan, Adityanarayanan ; Friedman, Sam F. ; Khurshid, Shaan ; Ng, Kenney ; Batra, Puneet ; Lubitz, Steven A. ; Philippakis, Anthony A. ; Uhler, Caroline ( April 2023 , Nature Communications)

Abstract
A fundamental challenge in diagnostics is integrating multiple modalities to develop a joint characterization of physiological state. Using the heart as a model system, we develop a cross-modal autoencoder framework for integrating distinct data modalities and constructing a holistic representation of cardiovascular state. In particular, we use our framework to construct such cross-modal representations from cardiac magnetic resonance images (MRIs), containing structural information, and electrocardiograms (ECGs), containing myoelectric information. We leverage the learned cross-modal representation to (1) improve phenotype prediction from a single, accessible phenotype such as ECGs; (2) enable imputation of hard-to-acquire cardiac MRIs from easy-to-acquire ECGs; and (3) develop a framework for performing genome-wide association studies in an unsupervised manner. Our results systematically integrate distinct diagnostic modalities into a common representation that better characterizes physiologic state.

more » « less
Lateral confined growth of cells activates Lef1 dependent pathways to regulate cell-state transitions

https://doi.org/10.1038/s41598-022-21596-4

Yuan, Luezhen ; Roy, Bibhas ; Ratna, Prasuna ; Uhler, Caroline ; Shivashankar, G. V. ( October 2022 , Scientific Reports)

Abstract
Long-term sustained mechano-chemical signals in tissue microenvironment regulate cell-state transitions. In recent work, we showed that laterally confined growth of fibroblasts induce dedifferentiation programs. However, the molecular mechanisms underlying such mechanically induced cell-state transitions are poorly understood. In this paper, we identify Lef1 as a critical somatic transcription factor for the mechanical regulation of de-differentiation pathways. Network optimization methods applied to time-lapse RNA-seq data identify Lef1 dependent signaling as potential regulators of such cell-state transitions. We show that Lef1 knockdown results in the down-regulation of fibroblast de-differentiation and that Lef1 directly interacts with the promoter regions of downstream reprogramming factors. We also evaluate the potential upstream activation pathways of Lef1, including the Smad4, Atf2, NFkB and Beta-catenin pathways, thereby identifying that Smad4 and Atf2 may be critical for Lef1 activation. Collectively, we describe an important mechanotransduction pathway, including Lef1, which upon activation, through progressive lateral cell confinement, results in fibroblast de-differentiation.

more » « less
Simple, fast, and flexible framework for matrix completion with infinite width neural networks

https://doi.org/10.1073/pnas.2115064119

Radhakrishnan, Adityanarayanan ; Stefanakis, George ; Belkin, Mikhail ; Uhler, Caroline ( April 2022 , Proceedings of the National Academy of Sciences)

Matrix completion problems arise in many applications including recommendation systems, computer vision, and genomics. Increasingly larger neural networks have been successful in many of these applications but at considerable computational costs. Remarkably, taking the width of a neural network to infinity allows for improved computational performance. In this work, we develop an infinite width neural network framework for matrix completion that is simple, fast, and flexible. Simplicity and speed come from the connection between the infinite width limit of neural networks and kernels known as neural tangent kernels (NTK). In particular, we derive the NTK for fully connected and convolutional neural networks for matrix completion. The flexibility stems from a feature prior, which allows encoding relationships between coordinates of the target matrix, akin to semisupervised learning. The effectiveness of our framework is demonstrated through competitive results for virtual drug screening and image inpainting/reconstruction. We also provide an implementation in Python to make our framework accessible on standard hardware to a broad audience.
more » « less
Full Text Available
Multiscale simulations of complex systems by learning their effective dynamics

https://doi.org/10.1038/s42256-022-00464-w

Vlachas, Pantelis R. ; Arampatzis, Georgios ; Uhler, Caroline ; Koumoutsakos, Petros ( April 2022 , Nature Machine Intelligence)

Full Text Available
Causal Structure Learning: A Combinatorial Perspective

https://doi.org/10.1007/s10208-022-09581-9

Squires, Chandler ; Uhler, Caroline ( January 2022 , Foundations of Computational Mathematics)

Abstract In this review, we discuss approaches for learning causal structure from data, also called causal discovery . In particular, we focus on approaches for learning directed acyclic graphs and various generalizations which allow for some variables to be unobserved in the available data. We devote special attention to two fundamental combinatorial aspects of causal structure learning. First, we discuss the structure of the search space over causal graphs. Second, we discuss the structure of equivalence classes over causal graphs, i.e., sets of graphs which represent what can be learned from observational data alone, and how these equivalence classes can be refined by adding interventional data.
more » « less
Full Text Available
Identifying 3D Genome Organization in Diploid Organisms via Euclidean Distance Geometry

https://doi.org/10.1137/21M1390372

Belyaeva, Anastasiya ; Kubjas, Kaie ; Sun, Lawrence J. ; Uhler, Caroline ( March 2022 , SIAM Journal on Mathematics of Data Science)

Full Text Available

« Prev Next »