GeoMatch: Efficient Large-scale Map Matching on Apache Spark

Zeidan, Ayman; Lagerspetz, Eemil; Zhao, Kai; Nurmi, Petteri; Tarkoma, Sasu; Vo, Huy T.

doi:10.1145/3402904

Citation Details

GeoMatch: Efficient Large-scale Map Matching on Apache Spark

We develop GeoMatch as a novel, scalable, and efficient big-data pipeline for large-scale map matching on Apache Spark. GeoMatch improves existing spatial big-data solutions by utilizing a novel spatial partitioning scheme inspired by Hilbert space-filling curves. Thanks to its partitioning scheme, GeoMatch can effectively balance operations across different processing units and achieve significant performance gains. GeoMatch also incorporates a dynamically adjustable error-correction technique that provides robustness against positioning errors. We demonstrate the effectiveness of GeoMatch through rigorous and extensive empirical benchmarks that consider large-scale urban spatial datasets ranging from 166,253 to 3.78B location measurements. We separately assess execution performance and accuracy of map matching and develop a benchmark framework for evaluating large-scale map matching. Results of our evaluation show up to 27.25-fold performance improvements compared to previous works while achieving better processing accuracy than current solutions. We also showcase the practical potential of GeoMatch with two urban management applications. GeoMatch and our benchmark framework are open-source. more »

Award ID(s):: 1827505

PAR ID:: 10286816

Author(s) / Creator(s):: Zeidan, Ayman; Lagerspetz, Eemil; Zhao, Kai; Nurmi, Petteri; Tarkoma, Sasu; Vo, Huy T.

Date Published:: 2020-10-02

Journal Name:: ACM/IMS Transactions on Data Science

Volume:: 1

Issue:: 3

ISSN:: 2691-1922

Page Range / eLocation ID:: 1 to 30

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3402904

More Like this