Spatial Temporal Analysis of 40,000,000,000,000 Internet Darkspace Packets

Kepner, Jeremy; Jones, Michael; Andersen, Daniel; Buluc, Aydin; Byun, Chansup; Claffy, K; Davis, Timothy; Arcand, William; Bernays, Jonathan; Bestor, David; Bergeron, William; Gadepally, Vijay; Houle, Micheal; Hubbell, Matthew; Klein, Anna; Meiners, Chad; Milechin, Lauren; Mullen, Julie; Pisharody, Sandeep; Prout, Andrew; Reuther, Albert; Rosa, Antonio; Samsi, Siddharth; Stetson, Doug; Tse, Adam; Yee, Charles; Michaleas, Peter

doi:10.1109/HPEC49654.2021.9622790

The Internet has never been more important to our society, and understanding the behavior of the Internet is essential. The Center for Applied Internet Data Analysis (CAIDA) Telescope observes a continuous stream of packets from an unsolicited darkspace representing 1/256 of the Internet. During 2019 and 2020 over 40,000,000,000,000 unique packets were collected representing the largest ever assembled public corpus of Internet traffic. Using the combined resources of the Supercomputing Centers at UC San Diego, Lawrence Berkeley National Laboratory, and MIT, the spatial temporal structure of anonymized source-destination pairs from the CAIDA Telescope data has been analyzed with GraphBLAS hierarchical hyper-sparse matrices. These analyses provide unique insight on this unsolicited Internet darkspace traffic with the discovery of many previously unseen scaling relations. The data show a significant sustained increase in unsolicited traffic corresponding to the start of the COVID19 pandemic, but relatively little change in the underlying scaling relations associated with unique sources, source fan-outs, unique links, destination fan-ins, and unique destinations. This work provides a demonstration of the practical feasibility and benefit of the safe collection and analysis of significant quantities of anonymized Internet traffic.

More Like this