NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Deep Companion Learning: Enhancing Generalization Through Historical Consistency

https://doi.org/10.1007/978-3-031-72913-3_22

Zhu, Ruizhao; Saligrama, Venkatesh (December 2024, Springer Nature Switzerland)

Free, publicly-accessible full text available December 2, 2025
Safe Linear Bandits over Unknown Polytopes

Gangrade, Aditya; Chen, Tianrui; Saligrama, Venkatesh (July 2024, Conference on Learning Theory)

Full Text Available
Testing the Feasibility of Linear Programs with Bandit Feedback

Gangrade, Aditya; Gopalan, Aditya; Saligrama, Venkatesh; Scott, Clay (July 2024, International Conference of Machine Learning)

While the recent literature has seen a surge in the study of constrained bandit problems, all existing methods for these begin by assuming the feasibility of the underlying problem. We initiate the study of testing such feasibility assumptions, and in particular address the problem in the linear bandit setting, thus characterising the costs of feasibility testing for an unknown linear program using bandit feedback. Concretely, we test if 9x : Ax 0 for an unknown A 2 Rm×d, by playing a sequence of actions xt 2 Rd, and observing Axt + noise in response. By identifying the hypothesis as determining the sign of the value of a minimax game, we construct a novel test based on low-regret algorithms and a nonasymptotic law of iterated logarithms. We prove that this test is reliable, and adapts to the ‘signal level,’ T, of any instance, with mean sample costs scaling as O(d2/T2). We complement this by a minimax lower bound of (d/T2) for sample costs of reliable tests, dominating prior asymptotic lower bounds by capturing the dependence on d, and thus elucidating a basic insight missing in the extant literature on such problems.
more » « less
Full Text Available
Learning to Drive Anywhere

Zhu, Ruizhao; Huang, Peng; Ohn-Bar, Eshed; Saligrama, Venkatesh (November 2023, Conference on Robot Learning)

Human drivers can seamlessly adapt their driving decisions across geographical locations with diverse conditions and rules of the road, e.g., left vs. right-hand traffic. In contrast, existing models for autonomous driving have been thus far only deployed within restricted operational domains, i.e., without accounting for varying driving behaviors across locations or model scalability. In this work, we propose AnyD, a single geographically-aware conditional imitation learning (CIL) model that can efficiently learn from heterogeneous and globally distributed data with dynamic environmental, traffic, and social characteristics. Our key insight is to introduce a high-capacity geo-location-based channel attention mechanism that effectively adapts to local nuances while also flexibly modeling similarities among regions in a data-driven manner. By optimizing a contrastive imitation objective, our proposed approach can efficiently scale across the inherently imbalanced data distributions and location-dependent events. We demonstrate the benefits of our AnyD agent across multiple datasets, cities, and scalable deployment paradigms, i.e., centralized, semi-supervised, and distributed agent training. Specifically, AnyD outperforms CIL baselines by over 14% in open-loop evaluation and 30% in closed-loop testing on CARLA.
more » « less
Full Text Available
Ideology Prediction from Scarce and Biased Supervision: Learn to Disregard the “What” and Focus on the “How”!

https://doi.org/10.18653/v1/2023.acl-long.530

Chen, Chen; Walker, Dylan; Saligrama, Venkatesh (January 2023, Association for Computational Linguistics)

We propose a novel supervised learning approach for political ideology prediction (PIP) that is capable of predicting out-of-distribution inputs. This problem is motivated by the fact that manual data-labeling is expensive, while self-reported labels are often scarce and exhibit significant selection bias. We propose a novel statistical model that decomposes the document embeddings into a linear superposition of two vectors; a latent neutral context vector independent of ideology, and a latent position vector aligned with ideology. We train an end-to-end model that has intermediate contextual and positional vectors as outputs. At deployment time, our model predicts labels for input documents by exclusively leveraging the predicted positional vectors. On two benchmark datasets we show that our model is capable of outputting predictions even when trained with as little as 5% biased data, and is significantly more accurate than the state-of-the-art. Through crowd-sourcing we validate the neutrality of contextual vectors, and show that context filtering results in ideological concentration, allowing for prediction on out-of-distribution examples.
more » « less
Full Text Available
Interpretable Compositional Representations for Robust Few-Shot Generalization

https://doi.org/10.1109/TPAMI.2022.3212633

Mishra, Samarth; Zhu, Pengkai; Saligrama, Venkatesh (October 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
Condensing CNNs with Partial Differential Equations

https://doi.org/10.1109/CVPR52688.2022.00069

Kag, Anil; Saligrama, Venkatesh (June 2022, CVPR)

Convolutional neural networks (CNNs) rely on the depth of the architecture to obtain complex features. It results in computationally expensive models for low-resource IoT devices. Convolutional operators are local and restricted in the receptive field, which increases with depth. We explore partial differential equations (PDEs) that offer a global receptive field without the added overhead of maintaining large kernel convolutional filters. We propose a new feature layer, called the Global layer, that enforces PDE constraints on the feature maps, resulting in rich features. These constraints are solved by embedding iterative schemes in the network. The proposed layer can be embedded in any deep CNN to transform it into a shallower network. Thus, resulting in compact and computationally efficient architectures achieving similar performance as the original network. Our experimental evaluation demonstrates that architectures with global layers require 2 − 5× less computational and storage budget without any significant loss in performance
more » « less
Full Text Available
Bandit Quickest Changepoint Detection

Gopalan, Aditya; Lakshminarayanan, Braghadeesh; Saligrama, Venkatesh (December 2021, Advances in Neural Information Processing Systems 34 (NeurIPS 2021))

Full Text Available
Time Adaptive Recurrent Neural Network

https://doi.org/10.1109/CVPR46437.2021.01490

Kag, Anil; Saligrama, Venkatesh (June 2021, CVPR)

Full Text Available
Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers

Ma, Yao; Olshevsky, Alex; Saligrama, Venkatesh; Czepesvari, Csoba (June 2021, Journal of machine learning research)

We consider worker skill estimation for the single-coin Dawid-Skene crowdsourcing model. In practice, skill-estimation is challenging because worker assignments are sparse and irregular due to the arbitrary and uncontrolled availability of workers. We formulate skill estimation as a rank-one correlation-matrix completion problem, where the observed components correspond to observed label correlation between workers. We show that the correlation matrix can be successfully recovered and skills are identifiable if and only if the sampling matrix (observed components) does not have a bipartite connected component. We then propose a projected gradient descent scheme and show that skill estimates converge to the desired global optima for such sampling matrices. Our proof is original and the results are surprising in light of the fact that even the weighted rank-one matrix factorization problem is NP-hard in general. Next, we derive sample complexity bounds in terms of spectral properties of the signless Laplacian of the sampling matrix. Our proposed scheme achieves state-of-art performance on a number of real-world datasets.
more » « less
Full Text Available

« Prev Next »

Search for: All records