Distributing Frank–Wolfe via Map-Reduce

Moharrer, Armin; Ioannidis, Stratis

doi:10.1007/s10115-018-1294-7

Citation Details

Distributing Frank–Wolfe via Map-Reduce

Large-scale optimization problems abound in data mining and machine learning applications, and the computational challenges they pose are often addressed through parallelization. We identify structural properties under which a convex optimization problem can be massively parallelized via map-reduce operations using the Frank–Wolfe (FW) algorithm. The class of problems that can be tackled this way is quite broad and includes experimental design, AdaBoost, and projection to a convex hull. Implementing FW via map-reduce eases parallelization and deployment via commercial distributed computing frameworks. We demonstrate this by implementing FW over Spark, an engine for parallel data processing, and establish that parallelization through map-reduce yields significant performance improvements: We solve problems with 20 million variables using 350 cores in 79 min; the same operation takes 48 h when executed serially. more »

Award ID(s):: 1750539

PAR ID:: 10083966

Author(s) / Creator(s):: Moharrer, Armin; Ioannidis, Stratis

Date Published:: 2018-12-18

Journal Name:: Knowledge and Information Systems

ISSN:: 0219-1377

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1007/s10115-018-1294-7

More Like this