Search for: All records

Award ID contains: 1955981

« Prev Next »

Total Resources

13

Resource Type
Conference Paper

10

Conference Proceeding

0

Dataset

0

Journal Article

3

Workshop Report

0

Availability
Full Text / Resource Available

11

Citation Only

2

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scaffolding a Student to Instill Knowledge

Anil Kag, Durmus Alp ( May 2023 , International Conference on Learning Representations)

We propose a novel knowledge distillation (KD) method to selectively instill teacher knowledge into a student model motivated by situations where the student’s capacity is significantly smaller than that of the teachers. In vanilla KD, the teacher primarily sets a predictive target for the student to follow, and we posit that this target is overly optimistic due to the student’s lack of capacity. We develop a novel scaffolding scheme where the teacher, in addition to setting a predictive target, also scaffolds the student’s prediction by censoring hard-to-learn examples. The student model utilizes the same information as the teacher’s soft-max predictions as inputs, and in this sense, our proposal can be viewed as a natural variant of vanilla KD. We show on synthetic examples that censoring hard-examples leads to smoothening the student’s loss landscape so that the student encounters fewer local minima. As a result, it has good generalization properties. Against vanilla KD, we achieve improved performance and are comparable to more intrusive techniques that leverage feature matching on benchmark datasets.
more » « less
Free, publicly-accessible full text available May 1, 2024
Efficient Edge Inference by Selective Query

Anil Kag, Igor Fedorov ( May 2023 , International Conference on Learning Representations)

Edge devices provide inference on predictive tasks to many end-users. However, deploying deep neural networks that achieve state-of-the-art accuracy on these devices is infeasible due to edge resource constraints. Nevertheless, cloud-only processing, the de-facto standard, is also problematic, since uploading large amounts of data imposes severe communication bottlenecks. We propose a novel end-to-end hybrid learning framework that allows the edge to selectively query only those hard examples that the cloud can classify correctly. Our framework optimizes over neural architectures and trains edge predictors and routing models so that the overall accuracy remains high while minimizing the overall latency. Training a hybrid learner is difficult since we lack annotations of hard edge-examples. We introduce a novel proxy supervision in this context and show that our method adapts seamlessly and near optimally across different latency regimes. On the ImageNet dataset, our proposed method deployed on a micro-controller unit exhibits 25% reduction in latency compared to cloud-only processing while suffering no excess loss.
more » « less
Free, publicly-accessible full text available May 1, 2024
Ideology Prediction from Scarce and Biased Supervision: Learn to Disregard the “What” and Focus on the “How”!

https://doi.org/10.18653/v1/2023.acl-long.530

Chen, Chen ; Walker, Dylan ; Saligrama, Venkatesh ( January 2023 , Association for Computational Linguistics)

We propose a novel supervised learning approach for political ideology prediction (PIP) that is capable of predicting out-of-distribution inputs. This problem is motivated by the fact that manual data-labeling is expensive, while self-reported labels are often scarce and exhibit significant selection bias. We propose a novel statistical model that decomposes the document embeddings into a linear superposition of two vectors; a latent neutral context vector independent of ideology, and a latent position vector aligned with ideology. We train an end-to-end model that has intermediate contextual and positional vectors as outputs. At deployment time, our model predicts labels for input documents by exclusively leveraging the predicted positional vectors. On two benchmark datasets we show that our model is capable of outputting predictions even when trained with as little as 5% biased data, and is significantly more accurate than the state-of-the-art. Through crowd-sourcing we validate the neutrality of contextual vectors, and show that context filtering results in ideological concentration, allowing for prediction on out-of-distribution examples.
more » « less
Full Text Available
ActiveHedge: Hedge meets Active Learning. ICML 2022: 11694-11709

Bhuvesh Kumar, Jacob D. ( August 2022 , ICML)

We consider the classical problem of multi-class prediction with expert advice, but with an active learning twist. In this new setting the learner will only query the labels of a small number of examples, but still aims to minimize regret to the best expert as usual; the learner is also allowed a very short "burn-in" phase where it can fast-forward and query certain highly-informative examples. We design an algorithm that utilizes Hedge (aka Exponential Weights) as a subroutine, and we show that under a very particular combinatorial constraint on the matrix of expert predictions we can obtain a very strong regret guarantee while querying very few labels. This constraint, which we refer to as ζ -compactness, or just compactness, can be viewed as a non-stochastic variant of the disagreement coefficient, another popular parameter used to reason about the sample complexity of active learning in the IID setting. We also give a polynomial-time algorithm to calculate the ζ -compactness of a matrix up to an approximation factor of 3.
more » « less
Full Text Available
Detecting Correlated Gaussian Databases

Kahraman, Zeynep ; Nazer, Bobak ( July 2022 , IEEE International Symposium on Information Theory)

Full Text Available
Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk

Tianrui Chen, Aditya Gangrade ( July 2022 , International Conference on Machine Learning)

Full Text Available
Condensing CNNs with Partial Differential Equations

https://doi.org/10.1109/CVPR52688.2022.00069

Kag, Anil ; Saligrama, Venkatesh ( June 2022 , CVPR)

Convolutional neural networks (CNNs) rely on the depth of the architecture to obtain complex features. It results in computationally expensive models for low-resource IoT devices. Convolutional operators are local and restricted in the receptive field, which increases with depth. We explore partial differential equations (PDEs) that offer a global receptive field without the added overhead of maintaining large kernel convolutional filters. We propose a new feature layer, called the Global layer, that enforces PDE constraints on the feature maps, resulting in rich features. These constraints are solved by embedding iterative schemes in the network. The proposed layer can be embedded in any deep CNN to transform it into a shallower network. Thus, resulting in compact and computationally efficient architectures achieving similar performance as the original network. Our experimental evaluation demonstrates that architectures with global layers require 2 − 5× less computational and storage budget without any significant loss in performance
more » « less
Full Text Available
Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data

Samarth Mishra, Rameswar Panda ( May 2022 , IEEE Conference on Computer Vision and Pattern Recognition)

Full Text Available
Dopamine depletion selectively disrupts interactions between striatal neuron subtypes and LFP oscillations

https://doi.org/10.1016/j.celrep.2021.110265

Zemel, Dana ; Gritton, Howard ; Cheung, Cyrus ; Shankar, Sneha ; Kramer, Mark ; Han, Xue ( January 2022 , Cell Reports)

Full Text Available
Bandit Quickest Changepoint Detection

Gopalan, Aditya ; Lakshminarayanan, Braghadeesh ; Saligrama, Venkatesh ( December 2021 , Advances in Neural Information Processing Systems 34 (NeurIPS 2021))

Full Text Available

« Prev Next »