Poster: Revealing Hidden Secrets: Decoding DNS PTR records with Large Language Models

Thiagarajan, Kedar; Carisimo, Esteban; Bustamante, Fabian E

Citation Details

Geolocating network devices is essential for various research areas. Yet, despite notable advancements, it continues to be one of the most challenging issues for experimentalists. An approach for geolocating that has proved effective is leveraging geolocating hints in PTR records associated with network devices. We argue that Large Language Models (LLMs), rather than humans, are better equipped to identify patterns in DNS PTR records, and significantly scale the coverage of tools like Hoiho. We introduce an approach that leverages LLMs to classify PTR records, and generate regular expressions for these classes, and hint-to-location mapping. We present preliminary results showing the applicability of using LLMs as a scalable approach to leverage PTR records for infrastructure geolocation. more »

Award ID(s):: 2246475

PAR ID:: 10534921

Author(s) / Creator(s):: Thiagarajan, Kedar; Carisimo, Esteban; Bustamante, Fabian E

Publisher / Repository:: Proceedings of the ACM SIGCOMM 2024 Conference: Posters and Demos

Date Published:: 2024-08-01

ISBN:: 9798400707179

Subject(s) / Keyword(s):: Internet Measurement, Natural Language Processing, Large Lan- guage Models, Internet Geolocation, RIPEAtlas

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this