Ordalia: Deep Learning Hyperparameter Search via Generalization Error Bounds Extrapolation

Buratti, Benedetto J.; Upfal, Eli

doi:10.1109/BigData47090.2019.9006144

Citation Details

Ordalia: Deep Learning Hyperparameter Search via Generalization Error Bounds Extrapolation

We introduce Ordalia, a novel approach for speeding up deep learning hyperparameter optimization search through early-pruning of less promising configurations. Our method leverages empirical and theoretical results characterizing the shape of the generalization error curve for increasing training data size and number of epochs. We show that with relatively small computational resources one can estimate the dominant parameters of neural networks' learning curves to obtain consistently good evaluations of their learning process to reliably early-eliminate non-promising configurations. By iterating this process with increasing training resources Ordalia rapidly converges to a small candidate set that includes many of the most promising configurations. We compare the performance of Ordalia with Hyperband, the state-of-the-art model-free hyperparameter optimization algorithm, and show that Ordalia consistently outperforms it on a variety of deep learning tasks. Ordalia conservative use of computational resources and ability to evaluate neural networks learning progress leads to a much better exploration and coverage of the search space, which ultimately produces superior neural network configurations. more »

Award ID(s):: 1813444 1740741

PAR ID:: 10183275

Author(s) / Creator(s):: Buratti, Benedetto J.; Upfal, Eli

Date Published:: 2019-12-01

Journal Name:: 2019 IEEE International Conference on Big Data (Big Data)

Page Range / eLocation ID:: 180 to 187

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/BigData47090.2019.9006144

More Like this