Partitioning Communication Streams Into Graph Snapshots

Wendt, Jeremy D.; Field, Richard V.; Phillips, Cynthia A.; Prasadan, Arvind; Wilson, Tegan; Soundarajan, Sucheta; Bhowmick, Sanjukta

doi:10.1109/TNSE.2022.3223614

Citation Details

Partitioning Communication Streams Into Graph Snapshots

We present EASEE (Edge Advertisements into Snapshots using Evolving Expectations) for partitioning streaming communication data into static graph snapshots. Given streaming communication events (A talks to B), EASEE identifies when events suffice for a static graph (a snapshot ). EASEE uses combinatorial statistical models to adaptively find when a snapshot is stable, while watching for significant data shifts – indicating a new snapshot should begin. If snapshots are not found carefully, they poorly represent the underlying data – and downstream graph analytics fail: We show a community detection example. We demonstrate EASEE's strengths against several real-world datasets, and its accuracy against known-answer synthetic datasets. Synthetic datasets' results show that (1) EASEE finds known-answer data shifts very quickly; and (2) ignoring these shifts drastically affects analytics on resulting snapshots. We show that previous work misses these shifts. Further, we evaluate EASEE against seven real-world datasets (330 K to 2.5B events), and find snapshot-over-time behaviors missed by previous works. Finally, we show that the resulting snapshots' measured properties (e.g., graph density) are altered by how snapshots are identified from the communication event stream. In particular, EASEE's snapshots do not generally “densify” over time, contradicting previous influential results that used simpler partitioning methods. more »

Award ID(s):: 1916084

PAR ID:: 10388359

Author(s) / Creator(s):: Wendt, Jeremy D.; Field, Richard V.; Phillips, Cynthia A.; Prasadan, Arvind; Wilson, Tegan; Soundarajan, Sucheta; Bhowmick, Sanjukta

Date Published:: 2022-12-01

Journal Name:: IEEE Transactions on Network Science and Engineering

ISSN:: 2334-329X

Page Range / eLocation ID:: 1 to 18

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TNSE.2022.3223614

More Like this