Search for: All records

Creators/Authors contains: "Vempala, Santosh S."

« Prev Next »

Total Resources

19

Resource Type
Conference Paper

16

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

19

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Beyond Moments: Robustly Learning Affine Transformations with Asymptotically Optimal Error

https://doi.org/10.1109/FOCS57990.2023.00147

Jia, He ; Kothari, Pravesh K. ; Vempala, Santosh S. ( November 2023 , IEEE)
The k-cap Process on Geometric Random Graphs

Reid, Mirabel ; Vempala, Santosh S. ( January 2023 , Conference on Learning Theory)

Full Text Available
Is Planted Coloring Easier than Planted Clique?

Kothari, Pravesh ; Vempala, Santosh S. ; Wein, Alex ; Xu, Jeff ( January 2023 , Conference on Learning Theory)

Full Text Available
Condition-number-independent convergence rate of Riemannian Hamiltonian Monte Carlo with numerical integrators

Kook, Yunbum ; Lee, Yin Tat ; Shen, Ruoqi ; Vempala, Santosh S. ( January 2023 , Conference on Learning Theory)

Full Text Available
Socially fair network design via iterative rounding

https://doi.org/10.1016/j.orl.2022.07.011

Laddha, Aditi ; Singh, Mohit ; Vempala, Santosh S. ( September 2022 , Operations Research Letters)

Full Text Available
Assemblies of neurons learn to classify well-separated distributions

Dabagia, Max ; Papadimitriou, Christos ; Vempala, Santosh S. ( June 2022 , Conference on Learning Theory)

An assembly is a large population of neurons whose synchronous firing represents a memory, concept, word, and other cognitive category. Assemblies are believed to provide a bridge between high-level cognitive phenomena and low-level neural activity. Recently, a computational system called the \emph{Assembly Calculus} (AC), with a repertoire of biologically plausible operations on assemblies, has been shown capable of simulating arbitrary space-bounded computation, but also of simulating complex cognitive phenomena such as language, reasoning, and planning. However, the mechanism whereby assemblies can mediate {\em learning} has not been known. Here we present such a mechanism, and prove rigorously that, for simple classification problems defined on distributions of labeled assemblies, a new assembly representing each class can be reliably formed in response to a few stimuli from the class; this assembly is henceforth reliably recalled in response to new stimuli from the same class. Furthermore, such class assemblies will be distinguishable as long as the respective classes are reasonably separated — for example, when they are clusters of similar assemblies, or more generally separable with margin by a linear threshold function. To prove these results, we draw on random graph theory with dynamic edge weights to estimate sequences of activated vertices, yielding strong generalizations of previous calculations and theorems in this field over the past five years. These theorems are backed up by experiments demonstrating the successful formation of assemblies which represent concept classes on synthetic data drawn from such distributions, and also on MNIST, which lends itself to classification through one assembly per digit. Seen as a learning algorithm, this mechanism is entirely online, generalizes from very few samples, and requires only mild supervision — all key attributes of learning in a model of the brain. We argue that this learning mechanism, supported by separate sensory pre-processing mechanisms for extracting attributes, such as edges or phonemes, from real world data, can be the basis of biological learning in cortex.
more » « less
Full Text Available
Robustly learning mixtures of k arbitrary Gaussians

https://doi.org/10.1145/3519935.3519953

Bakshi, Ainesh ; Diakonikolas, Ilias ; Jia, He ; Kane, Daniel M. ; Kothari, Pravesh K. ; Vempala, Santosh S. ( June 2022 , Symposium on Theory of Computation)

Full Text Available
How and When Random Feedback Works: A Case Study of Low-Rank Matrix Factorization.

Garg, Shivam ; Vempala, Santosh S. ( January 2022 , AISTATS)

The success of gradient descent in ML and especially for learning neural networks is remarkable and robust. In the context of how the brain learns, one aspect of gradient descent that appears biologically difficult to realize (if not implausible) is that its updates rely on feedback from later layers to earlier layers through the same connections. Such bidirected links are relatively few in brain networks, and even when reciprocal connections exist, they may not be equi-weighted. Random Feedback Alignment (Lillicrap et al., 2016), where the backward weights are random and fixed, has been proposed as a bio-plausible alternative and found to be effective empirically. We investigate how and when feedback alignment (FA) works, focusing on one of the most basic problems with layered structure n×m, the goal is to find a low rank factorization Zn×rWr×m that minimizes the error ∥ZW−Y∥F. Gradient descent solves this problem optimally. We show that FA finds the optimal solution when r≥rank(Y). We also shed light on how FA works. It is observed empirically that the forward weight matrices and (random) feedback matrices come closer during FA updates. Our analysis rigorously derives this phenomenon and shows how it facilitates convergence of FA*, a closely related variant of FA. We also show that FA can be far from optimal when r more » « less
Full Text Available
Provable Lifelong Learning of Representations

Cao, Xinyuan ; Liu, Weiyang ; Vempala, Santosh S. ( January 2022 , AISTATS)

In lifelong learning, tasks (or classes) to be learned arrive sequentially over time in arbitrary order. During training, knowledge from previous tasks can be captured and transferred to subsequent ones to improve sample efficiency. We consider the setting where all target tasks can be represented in the span of a small number of unknown linear or nonlinear features of the input data. We propose a lifelong learning algorithm that maintains and refines the internal feature representation. We prove that for any desired accuracy on all tasks, the dimension of the representation remains close to that of the underlying representation. The resulting sample complexity improves significantly on existing bounds. In the setting of linear features, our algorithm is provably efficient and the sample complexity for input dimension d, m tasks with k features up to error ϵ is O~(dk1.5/ϵ+km/ϵ). We also prove a matching lower bound for any lifelong learning algorithm that uses a single task learner as a black box. We complement our analysis with an empirical study, including a heuristic lifelong learning algorithm for deep neural networks. Our method performs favorably on challenging realistic image datasets compared to state-of-the-art continual learning methods.
more » « less
Full Text Available
A Unified Approach to Discrepancy Minimization

Bansal, Nikhil ; Laddha, Aditi ; Vempala, Santosh S. ( January 2022 , RANDOM-APPROX)

We study a unified approach and algorithm for constructive discrepancy minimization based on a stochastic process. By varying the parameters of the process, one can recover various state-of-the-art results. We demonstrate the flexibility of the method by deriving a discrepancy bound for smoothed instances, which interpolates between known bounds for worst-case and random instances.
more » « less
Full Text Available

« Prev Next »