When does gradient descent with logistic loss interpolate using deep networks with smoothed ReLU activations?
- Award ID(s):
- 2031883
- Publication Date:
- NSF-PAR ID:
- 10248696
- Journal Name:
- Proceedings of the 34th Conference on Learning Theory (COLT2021)
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found