How much data is sufficient to learn high-performing algorithms? Generalization guarantees for data-driven algorithm design

Balcan, M.-F.; DeBlasio, D.; Dick, T.; Kingsford, C.; Sandholm, T.; Viterick, E.

Citation Details

Algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case guarantee. Worst-case instances, however, may be rare or nonexistent in practice. A growing body of research has demonstrated that data-driven algorithm design can lead to significant improvements in performance. This approach uses a training set of problem instances sampled from an unknown, application-specific distribution and returns a parameter setting with strong average performance on the training set. more »

Award ID(s):: 1901403 1919453 1910321

PAR ID:: 10288821

Author(s) / Creator(s):: Balcan, M.-F.; DeBlasio, D.; Dick, T.; Kingsford, C.; Sandholm, T.; Viterick, E.

Date Published:: 2021-01-01

Journal Name:: STOC 2021: Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing

Page Range / eLocation ID:: 919–932

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this