GreedyML: A Parallel Algorithm for Maximizing Constrained Submodular Functions

Gopal, Shivaram; Ferdous, S M; Pothen, Alex; Maji, Hemanta

doi:10.4230/LIPICS.SEA.2025.19

Citation Details

GreedyML: A Parallel Algorithm for Maximizing Constrained Submodular Functions

We describe a parallel approximation algorithm for maximizing monotone submodular functions subject to hereditary constraints on distributed memory multiprocessors. Our work is motivated by the need to solve submodular optimization problems on massive data sets, for practical contexts such as data summarization, machine learning, and graph sparsification. Our work builds on the randomized distributed RandGreeDI algorithm, proposed by Barbosa, Ene, Nguyen, and Ward (2015). This algorithm computes a distributed solution by randomly partitioning the data among all the processors and then employing a single accumulation step in which all processors send their partial solutions to one processor. However, for large problems, the accumulation step exceeds the memory available on a processor, and the processor which performs the accumulation becomes a computational bottleneck. Hence we propose a generalization of the RandGreeDI algorithm that employs multiple accumulation steps to reduce the memory required. We analyze the approximation ratio and the time complexity of the algorithm (in the BSP model). We evaluate the new GreedyML algorithm on three classes of problems, and report results from large-scale data sets with millions of elements. The results show that the GreedyML algorithm can solve problems where the sequential Greedy and distributed RandGreeDI algorithms fail due to memory constraints. For certain computationally intensive problems, the GreedyML algorithm is faster than the RandGreeDI algorithm. The observed approximation quality of the solutions computed by the GreedyML algorithm closely matches those obtained by the RandGreeDI algorithm on these problems. more »

Award ID(s):: 2055605 2327981

PAR ID:: 10630669

Author(s) / Creator(s):: Gopal, Shivaram; Ferdous, S M; Pothen, Alex; Maji, Hemanta

Editor(s):: Mutzel, Petra; Prezza, Nicola

Publisher / Repository:: Schloss Dagstuhl – Leibniz-Zentrum für Informatik

Date Published:: 2025-01-01

Volume:: 338

ISSN:: 1868-8969

ISBN:: 978-3-95977-375-1

Page Range / eLocation ID:: 19:1-19:21

Subject(s) / Keyword(s):: Combinatorial optimization submodular functions distributed algorithms approximation algorithms data summarization Computing methodologies → Distributed algorithms Theory of computation → Distributed algorithms Theory of computation → Facility location and clustering Theory of computation → Packing and covering problems Theory of computation → Nearest neighbor algorithms Theory of computation → Divide and conquer Theory of computation → Sparsification and spanners Theory of computation → Discrete optimization Computing methodologies → Feature selection

Format(s):: Medium: X Size: 21 pages; 1597453 bytes Other: application/pdf

Size(s):: 21 pages 1597453 bytes

Right(s):: Creative Commons Attribution 4.0 International license; info:eu-repo/semantics/openAccess

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.4230/LIPICS.SEA.2025.19

More Like this