Beyond Text-to-SQL for IoT Defense: A Comprehensive Framework for Querying and Classifying IoT Threats

Pavlich, Ryan; Ebadi, Nima; Tarbell, Richard; Linares, Billy; Tan, Adrian; Humphreys, Rachael; Das, Jayanta; Ghandiparsi, Rambod; Haley, Hannah; George, Jerris; Slavin, Rocky; Choo, Kim-Kwang Raymond; Dietrich, Glenn; Rios, Anthony

Citation Details

This content will become publicly available on May 1, 2026

Beyond Text-to-SQL for IoT Defense: A Comprehensive Framework for Querying and Classifying IoT Threats

Recognizing the promise of natural language interfaces to databases, prior studies have emphasized the development of text-to-SQL systems. Existing research has generally focused on generating SQL statements from text queries, and the broader challenge lies in inferring new information about the returned data. Our research makes two major contributions to address this gap. First, we introduce a novel Internet-of-Things (IoT) text-to-SQL dataset comprising 10,985 text-SQL pairs and 239,398 rows of network traffic activity. The dataset contains additional query types limited in prior text-to-SQL datasets, notably, temporal-related queries. Our dataset is sourced from a smart building’s IoT ecosystem exploring sensor read and network traffic data. Second, our dataset allows two-stage processing, where the returned data (network traffic) from a generated SQL can be categorized as malicious or not. Our results show that joint training to query and infer information about the data improves overall text-to-SQL performance, nearly matching that of substantially larger models. We also show that current large language models (e.g., GPT3.5) struggle to infer new information about returned data (i.e., they are bad at tabular data understanding), thus our dataset provides a novel test bed for integrating complex domain-specific reasoning into LLMs. more »

Award ID(s):: 2145357

PAR ID:: 10587598

Author(s) / Creator(s):: Pavlich, Ryan; Ebadi, Nima; Tarbell, Richard; Linares, Billy; Tan, Adrian; Humphreys, Rachael; Das, Jayanta; Ghandiparsi, Rambod; Haley, Hannah; George, Jerris; Slavin, Rocky; Choo, Kim-Kwang Raymond; Dietrich, Glenn; Rios, Anthony

Publisher / Repository:: Proceedings of the Workshop on Trustworthy NLP (TrustNLP 2025@NAACL)

Date Published:: 2025-05-01

Page Range / eLocation ID:: 1-12

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 1, 2026
Conference Paper:
The DOI is not currently available.

More Like this