NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Cloud Is Closer Than It Appears: Revisiting the Tradeoffs of Distributed Real-Time Inference

Sharma, Pragya; Qiu, Hang; Srivastava, Mani (August 2025, IEEE)

The increasing deployment of deep neural networks (DNNs) in cyber-physical systems (CPS) enhances perception fidelity, but imposes substantial computational demands on execution platforms, posing challenges to real-time control deadlines. Traditional distributed CPS architectures typically favor on-device inference to avoid network variability and contention-induced delays on remote platforms. However, this design choice places significant energy and computational demands on the local hardware. In this work, we revisit the assumption that cloud-based inference is intrinsically unsuitable for latency-sensitive control tasks. We demonstrate that, when provisioned with high-throughput compute resources, cloud platforms can effectively amortize network and queueing delays, enabling them to match or surpass on-device performance for real-time decision-making. Specifically, we develop a formal analytical model that characterizes distributed inference latency as a function of the sensing frequency, platform throughput, network delay, and task-specific safety constraints. We instantiate this model in the context of emergency braking for autonomous driving and validate it through extensive simulations using real-time vehicular dynamics. Our empirical results identify concrete conditions under which cloud-based inference adheres to safety margins more reliably than its on-device counterpart. These findings challenge prevailing design strategies and suggest that the cloud is not merely a feasible option, but often the preferred inference location for distributed CPS architectures. In this light, the cloud is not as distant as traditionally perceived; in fact, it is closer than it appears.
more » « less
Free, publicly-accessible full text available August 4, 2026
Toward Foundation Models for Online Complex Event Detection in CPS-IoT: A Case Study

https://doi.org/10.1145/3722565.3727198

Han, Liying; Dong, Gaofeng; Ouyang, Xiaomin; Kaplan, Lance; Cerutti, Federico; Srivastava, Mani (May 2025, ACM)

Complex events (CEs) play a crucial role in CPS-IoT applications, enabling high-level decision-making in domains such as smart monitoring and autonomous systems. However, most existing models focus on short-span perception tasks, lacking the long-term reasoning required for CE detection. CEs consist of sequences of short-time atomic events (AEs) governed by spatiotemporal dependencies. Detecting them is difficult due to long, noisy sensor data and the challenge of filtering out irrelevant AEs while capturing meaningful patterns. This work explores CE detection as a case study for CPS-IoT foundation models capable of long-term reasoning. We evaluate three approaches: (1) leveraging large language models (LLMs), (2) employing various neural architectures that learn CE rules from data, and (3) adopting a neurosymbolic approach that integrates neural models with symbolic engines embedding human knowledge. Our results show that the state-space model, Mamba, which belongs to the second category, outperforms all methods in accuracy and generalization to longer, unseen sensor traces. These findings suggest that state-space models could be a strong backbone for CPS-IoT foundation models for long-span reasoning tasks.
more » « less
Free, publicly-accessible full text available May 6, 2026
Leaking Through the Physics: Covert Cyber-Physical Data Exfiltration Through Unobserved Physics

https://doi.org/10.1007/978-3-031-93354-7_1

Chan, Matthew; Garcia, Luis; Snyder, Nathaniel; Lucas, Marcus; Ding, Aolin; Hass, Amin; Sokolsky, Oleg; Weimer, James; Tabuada, Paulo; Zonouz, Saman; et al (June 2025, Springer Nature Switzerland)

Free, publicly-accessible full text available June 27, 2026
SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

https://doi.org/10.1145/3708468.3711882

Quan, Pengrui; Ouyang, Xiaomin; Jeyakumar, Jeya Vikranth; Wang, Ziqi; Xing, Yang; Srivastava, Mani (February 2025, ACM)

Effective processing, interpretation, and management of sensor data have emerged as a critical component of cyber-physical systems. Traditionally, processing sensor data requires profound theoretical knowledge and proficiency in signal-processing tools. However, recent works show that Large Language Models (LLMs) have promising capabilities in processing sensory data, suggesting their potential as copilots for developing sensing systems. To explore this potential, we construct a comprehensive benchmark, SensorBench, to establish a quantifiable objective. The benchmark incorporates diverse real-world sensor datasets for various tasks. The results show that while LLMs exhibit considerable proficiency in simpler tasks, they face inherent challenges in processing compositional tasks with parameter selections compared to engineering experts. Additionally, we investigate four prompting strategies for sensor processing and show that self-verification can outperform all other baselines in 48% of tasks. Our study provides a comprehensive benchmark and prompting analysis for future developments, paving the way toward an LLM-based sensor processing copilot.
more » « less
Free, publicly-accessible full text available February 26, 2026
Towards a Performance-Driven Device-Edge-Cloud Relationship

https://doi.org/10.1145/3708468.3715681

Sharma, Pragya; Wang, Brian; Ouyang, Xiaomin; Nanayakkara, Rahal; Balaji, Bharathan; Tabuada, Paulo; Srivastava, Mani B (February 2025, ACM)

Real-time cyber-physical systems (CPS) rely on Perception-Cognition-Actuation (PCA) pipelines to enable autonomous observation, decisionmaking, and action execution. Closed-loop PCA systems utilize feedback-driven control to iteratively adapt actions in response to real-time environmental changes whereas open-loop PCA systems execute single actions without iterative feedback. The overall performance of these systems is inherently tied to the models selected for each pipeline component. Recent advancements in neural networks, particularly for perception tasks, have substantially enhanced CPS capabilities but have introduced significant complexity into the PCA pipeline. While traditional research [1] often evaluates perception models in static, controlled settings, it fails to account for the cascading latency and accuracy trade-offs that manifest across interconnected PCA modules in dynamic, real-time applications. Additionally, the proliferation of distributed device-edge-cloud architectures [2] has expanded computational possibilities but introduced new challenges in balancing latency and accuracy with resource constraints. The holistic impact of model selection, deployment platforms, and network conditions on application performance in real-time scenarios remains under-explored.
more » « less
Free, publicly-accessible full text available February 26, 2026
InfoMAE: Pairing-Efficient Cross-Modal Alignment with Informational Masked Autoencoders for IoT Signals

Kimure, Tomoyoshi; Li, Xinlin; Hanna, Osama; Chen, Yatong; Chen, Yizhuo; Kara, Denizhan; Wang, Tianshi; Li, Jinyang; Ouyang, Xiaomin; Liu, Shengzhong; et al (April 2025, ACM The Web Conference)

Free, publicly-accessible full text available April 28, 2026
InfoMAE: Pairing-Efficient Cross-Modal Alignment with Informational Masked Autoencoders for IoT Signals

Kimura, Tomoyoshi; Li, Xinlin; Hanna, Osama; Chen, Yatong; Chen, Yizhuo; Kara, Denizhan; Wang, Tianshi; Li, Jinyang; Ouyang, Xiaomin; Liu, Shengzhong; et al (April 2025, ACN The Web Conference)

Free, publicly-accessible full text available April 28, 2026
Reimagining Time Series Foundation Models: Metadata and State-Space Model Perspectives

Quan, Pengrui; Mulayim, Ozan; Han, Liying; Hong, Dezhi; Berges, Mario; Srivastava, Mani (December 2024, NeurIPS Workshop on Time Series in the Age of Large Models)

Free, publicly-accessible full text available December 14, 2025
Artificial Intelligence of Things: A Survey

https://doi.org/10.1145/3690639

Siam, Shakhrul Iman; Ahn, Hyunho; Liu, Li; Alam, Samiul; Shen, Hui; Cao, Zhichao; Shroff, Ness; Krishnamachari, Bhaskar; Srivastava, Mani; Zhang, Mi (January 2025, ACM Transactions on Sensor Networks)

The integration of the Internet of Things (IoT) and modern Artificial Intelligence (AI) has given rise to a new paradigm known as the Artificial Intelligence of Things (AIoT). In this survey, we provide a systematic and comprehensive review of AIoT research. We examine AIoT literature related to sensing, computing, and networking & communication, which form the three key components of AIoT. In addition to advancements in these areas, we review domain-specific AIoT systems that are designed for various important application domains. We have also created an accompanying GitHub repository, where we compile the papers included in this survey: https://github.com/AIoT-MLSys-Lab/AIoT-Survey. This repository will be actively maintained and updated with new research as it becomes available. As both IoT and AI become increasingly critical to our society, we believe that AIoT is emerging as an essential research field at the intersection of IoT and modern AI. It is our hope that this survey will serve as a valuable resource for those engaged in AIoT research and act as a catalyst for future explorations to bridge gaps and drive advancements in this exciting field.
more » « less
Free, publicly-accessible full text available January 31, 2026
Are Time Series Foundation Models Ready to Revolutionize Predictive Building Analytics?

https://doi.org/10.1145/3671127.3698177

Mulayim, Ozan Baris; Quan, Pengrui; Han, Liying; Ouyang, Xiaomin; Hong, Dezhi; Bergés, Mario; Srivastava, Mani (October 2024, ACM)

Recent advancements in large language models have spurred significant developments in Time Series Foundation Models (TSFMs). These models claim great promise in performing zero-shot forecasting without the need for specific training, leveraging the extensive "corpus" of time-series data they have been trained on. Forecasting is crucial in predictive building analytics, presenting substantial untapped potential for TSFMS in this domain. However, time-series data are often domain-specific and governed by diverse factors such as deployment environments, sensor characteristics, sampling rate, and data resolution, which complicates generalizability of these models across different contexts. Thus, while language models benefit from the relative uniformity of text data, TSFMs face challenges in learning from heterogeneous and contextually varied time-series data to ensure accurate and reliable performance in various applications. This paper seeks to understand how recently developed TSFMs perform in the building domain, particularly concerning their generalizability. We benchmark these models on three large datasets related to indoor air temperature and electricity usage. Our results indicate that TSFMs exhibit marginally better performance compared to statistical models on unseen sensing modality and/or patterns. Based on the benchmark results, we also provide insights for improving future TSFMs on building analytics.
more » « less
Full Text Available

« Prev Next »

Search for: All records