Learning to Extract and Use ASNs in Hostnames

Luckie, Matthew; Marder, Alexander; Fletcher, Marianne; Huffaker, Bradley; Claffy, K.

doi:10.1145/3419394.3423639

Citation Details

Learning to Extract and Use ASNs in Hostnames

We present the design, implementation, evaluation, and validation of a system that learns regular expressions (regexes) to extract Autonomous System Numbers (ASNs) from hostnames associated with router interfaces. We train our system with ASNs inferred by RouterToAsAssignment and bdrmapIT using topological constraints from traceroute paths, as well as ASNs recorded by operators in PeeringDB, to learn regexes for 206 different suffixes. Because these methods for inferring router ownership can infer the wrong ASN, we modify bdrmapIT to integrate this new capability to extract ASNs from hostnames. Evaluating against ground truth, our modification correctly distinguished stale from correct hostnames for 92.5% of hostnames with an ASN different from bdrmapIT’s initial inference. This modification allowed bdrmapIT to increase the agreement between extracted and inferred ASNs for these routers in the January 2020 ITDK from 87.4% to 97.1% and reduce the error rate from 1/7.9 to 1/34.5. This work presents a new avenue for collecting validation data, opening a broader horizon of opportunity for evidence-based router ownership inference. more »

Award ID(s):: 1724853 1901517

PAR ID:: 10289016

Author(s) / Creator(s):: Luckie, Matthew; Marder, Alexander; Fletcher, Marianne; Huffaker, Bradley; Claffy, K.

Date Published:: 2020-10-27

Journal Name:: IMC '20: Proceedings of the ACM Internet Measurement Conference

Page Range / eLocation ID:: 386 to 392

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3419394.3423639

More Like this