On Convergence and Generalization of Dropout Training

Mianjy, Poorya; Arora, Raman

Citation Details

We study dropout in two-layer neural networks with rectified linear unit (ReLU) activations. Under mild overparametrization and assuming that the limiting kernel can separate the data distribution with a positive margin, we show that dropout training with logistic loss achieves $$\epsilon$$-suboptimality in test error in $$O(1/\epsilon)$$ iterations. more »

Award ID(s):: 1943251

PAR ID:: 10213691

Author(s) / Creator(s):: Mianjy, Poorya; Arora, Raman

Date Published:: 2020-10-01

Journal Name:: Advances in neural information processing systems

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this