NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Robust and differentially private mean estimation

Liu, Xiyang; Kong, Weihao; Kakade, Sham; Oh, Sewoong (January 2021, Advances in neural information processing systems)

Full Text Available
Gradient Inversion with Generative Image Prior

Jeon, Jinwoo; Kim, jaechang; Lee, Kangwook; Oh, Sewoong; Ok, Jungseul (January 2021, Advances in neural information processing systems)

Full Text Available
Physical Layer Communication via Deep Learning

https://doi.org/10.1109/JSAIT.2020.2991562

Kim, Hyeji; Oh, Sewoong; Viswanath, Pramod (May 2020, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
LEARN Codes: Inventing Low-Latency Codes via Recurrent Neural Networks

https://doi.org/10.1109/JSAIT.2020.2988577

Jiang, Yihan; Kim, Hyeji; Asnani, Himanshu; Kannan, Sreeram; Oh, Sewoong; Viswanath, Pramod (May 2020, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
Deepcode: Feedback Codes via Deep Learning

https://doi.org/10.1109/JSAIT.2020.2986752

Kim, Hyeji; Jiang, Yihan; Kannan, Sreeram; Oh, Sewoong; Viswanath, Pramod (May 2020, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
PacGAN: The Power of Two Samples in Generative Adversarial Networks

https://doi.org/10.1109/JSAIT.2020.2983071

Lin, Zinan; Khetan, Ashish; Fanti, Giulia; Oh, Sewoong (May 2020, IEEE Journal on Selected Areas in Information Theory)

Full Text Available
Meta-learning for Mixed Linear Regression

Weihao Kong, Raghav Somani (January 2020, International Conference on Machine Learning)
null (Ed.)
In modern supervised learning, there are a large number of tasks, but many of them are associated with only a small amount of labelled data. These include data from medical image processing and robotic interaction. Even though each individual task cannot be meaningfully trained in isolation, one seeks to meta-learn across the tasks from past experiences by exploiting some similarities. We study a fundamental question of interest: When can abundant tasks with small data compensate for lack of tasks with big data? We focus on a canonical scenario where each task is drawn from a mixture of 𝑘 linear regressions, and identify sufficient conditions for such a graceful exchange to hold; there is little loss in sample complexity even when we only have access to small data tasks. To this end, we introduce a novel spectral approach and show that we can efficiently utilize small data tasks with the help of Ω̃ (𝑘3/2) medium data tasks each with Ω̃ (𝑘1/2) examples.
more » « less
Full Text Available
Projection Efficient Subgradient Method and Optimal Nonsmooth Frank-Wolfe Method

Kiran K. Thekumparampil, Prateek Jain (January 2020, Advances in Neural Information Processing Systems 33 (NeurIPS 2020))
null (Ed.)
Full Text Available
SPECTRE: Defending against backdoor attacks using robust covariance estimation

Jonathan Hayase, Weihao Kong (January 2020, International Conference on Machine Learning)
null (Ed.)
Full Text Available
InfoGAN-CR and ModelCentrality: Self-supervised Model Training and Selection for Disentangling GANs

Zinan Lin, Kiran Thekumparampil (January 2020, International Conference on Machine Learning)
null (Ed.)
Disentangled generative models map a latent code vector to a target space, while enforcing that a subset of the learned latent codes are interpretable and associated with distinct properties of the target distribution. Recent advances have been dominated by Variational AutoEncoder (VAE)-based methods, while training disentangled generative adversarial networks (GANs) remains challenging. In this work, we show that the dominant challenges facing disentangled GANs can be mitigated through the use of self-supervision. We make two main contributions: first, we design a novel approach for training disentangled GANs with self-supervision. We propose contrastive regularizer, which is inspired by a natural notion of disentanglement: latent traversal. This achieves higher disentanglement scores than state-of-the-art VAE- and GAN-based approaches. Second, we propose an unsupervised model selection scheme called ModelCentrality, which uses generated synthetic samples to compute the medoid (multi-dimensional generalization of median) of a collection of models. The current common practice of hyper-parameter tuning requires using ground-truths samples, each labelled with known perfect disentangled latent codes. As real datasets are not equipped with such labels, we propose an unsupervised model selection scheme and show that it finds a model close to the best one, for both VAEs and GANs. Combining contrastive regularization with ModelCentrality, we improve upon the state-of-the-art disentanglement scores significantly, without accessing the supervised data.
more » « less
Full Text Available

« Prev Next »

Search for: All records