Solving All-Pairs Shortest-Paths Problem in Large Graphs Using Apache Spark

Schoeneman, Frank; Zola, Jaroslaw

doi:10.1145/3337821.3337852

Citation Details

Solving All-Pairs Shortest-Paths Problem in Large Graphs Using Apache Spark

Algorithms for computing All-Pairs Shortest-Paths (APSP) are critical building blocks underlying many practical applications. The standard sequential algorithms, such as Floyd-Warshall and Johnson, quickly become infeasible for large input graphs, necessitating parallel approaches. In this work, we propose, implement and thoroughly analyse different strategies for APSP on distributed memory clusters with Apache Spark. Our solvers are designed for large undirected weighted graphs, and differ in complexity and degree of reliance on techniques outside of pure Spark API. We demonstrate that the best performing solver is able to handle APSP problems with over 200,000 vertices on a 1024-core cluster. However, it requires auxiliary shared persistent storage to compensate for missing Spark functionality. more »

Award ID(s):: 1910539

NSF-PAR ID:: 10145329

Author(s) / Creator(s):: Schoeneman, Frank; Zola, Jaroslaw

Date Published:: 2019-01-01

Journal Name:: Proceedings of the 48th International Conference on Parallel Processing

Page Range / eLocation ID:: 1 to 10

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3337821.3337852

More Like this