NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CALM: Multimodal Cognitive Load Assessment Framework via Engineered and Explainable Features

https://doi.org/10.1109/PerComWorkshops65533.2025.00067

Myrick, Cahree; Ghosh, Indrajeet; Jayarajah, Kasthuri; Roy, Nirmalya (March 2025, IEEE)

Free, publicly-accessible full text available March 17, 2026
TAVIC-DAS: Task and Channel-Aware Variable-Rate Image Compression for Distributed Autonomous System

https://doi.org/10.1109/PerComWorkshops65533.2025.00074

Shinde, Gaurav; Ravi, Anuradha; Dey, Emon; Lewis, Jared; Roy, Nirmalya (March 2025, IEEE)

In network-constrained environments, distributed multi-agent systems—such as UGVs and UAVs—must communicate effectively to support computationally demanding scene perception tasks like semantic and instance segmentation. These tasks are challenging because they require high accuracy even when using low-quality images, and the network limitations restrict the amount of data that can be transmitted between agents. To overcome the above challenges, we propose TAVIC-DAS to perform a task and channel-aware variable-rate image compression to enable distributed task execution and minimize communication latency by transmitting compressed images. TAVIC-DAS proposes a novel image compression and decompression framework (distributed across agents) that integrates channel parameters such as RSSI and data rate into a task-specific "semantic segmentation" DNN to generate masks representing the object of interest in the scene (ROI maps) by determining a high pixel density needed to represent objects of interest and low density to represents surrounding pixels within an image. Additionally, to accommodate agents with limited computational resources, TAVIC-DAS incorporates resource-aware model quantization. We evaluated TAVIC-DAS on platforms such as ROSMaster X3 and Jetson Xavier, which communicated using a low-frequency proprietary Doodle radio operating at 915 MHz. The experimental results show that TAVIC-DAS achieves approximately 7.62% higher PSNR and is about 6.39% more resource efficient compared to state-of-the-art techniques.
more » « less
Free, publicly-accessible full text available March 17, 2026
DACC-Comm: DNN-Powered Adaptive Compression and Flow Control for Robust Communication in Network-Constrained Environment

https://doi.org/10.1109/COMSNETS63942.2025.10885774

Dey, Emon; Ravi, Anuradha; Lewis, Jared; Kumar, Vinay Krishna; Freeman, Jade; Gregory, Timothy; Suri, Niranjan; Busart, Carl; Roy, Nirmalya (January 2025, IEEE)

Robust communication is vital for multi-agent robotic systems involving heterogeneous agents like Unmanned Aerial Vehicles (UAVs) and Unmanned Ground Vehicles (UGVs) operating in dynamic and contested environments. These agents often communicate to collaboratively execute critical tasks for perception awareness and are faced with different communication challenges: (a) The disparity in velocity between these agents results in rapidly changing distances, in turn affecting the physical channel parameters such as Received Signal Strength Indicator (RSSI), data rate (applicable for certain networks) and most importantly "reliable data transfer", (b) As these devices work in outdoor and network-deprived environments, they tend to use proprietary network technologies with low frequencies to communicate long range, which tremendously reduces the available bandwidth. This poses a challenge when sending large amounts of data for time-critical applications. To mitigate the above challenges, we propose DACC-Comm, an adaptive flow control and compression sensing framework to dynamically adjust the receiver window size and selectively sample the image pixels based on various network parameters such as latency, data rate, RSSI, and physiological factors such as the variation in movement speed between devices. DACC-Comm employs state-of-the-art DNN (TABNET) to optimize the payload and reduce the retransmissions in the network, in turn maintaining low latency. The multi-head transformer-based prediction model takes the network parameters and physiological factors as input and outputs (a) an optimal receiver window size for TCP, determining how many bytes can be sent without the sender waiting for an acknowledgment (ACK) from the receiver, (b) a compression ratio to sample a subset of pixels from an image. We propose a novel sampling strategy to select the image pixels, which are then encoded using a feature extractor. To optimize the amount of data sent across the network, the extracted feature is further quantized to INT8 with the help of post-training quantization. We evaluate DACC-Comm on an experimental testbed comprising Jackal and ROSMaster2 UGV devices that communicate image features using a proprietary radio (Doodle) in 915-MHz frequency. We demonstrate that DACC-Comm improves the retransmission rate by ≈17% and reduces the overall latency by ≈12%. The novel compression sensing strategy reduces the overall payload by ≈56%.
more » « less
Full Text Available
A Novel ROS2 QoS Policy-Enabled Synchronizing Middleware for Co-Simulation of Heterogeneous Multi-Robot Systems

https://doi.org/10.1109/ICCCN58024.2023.10230109

Dey, Emon; Walczak, Mikolaj; Anwar, Mohammad Saeid; Roy, Nirmalya; Freeman, Jade; Gregory, Timothy; Suri, Niranjan; Busart, Carl (July 2023, IEEE)
SrPPG: Semi-Supervised Adversarial Learning for Remote Photoplethysmography with Noisy Data

https://doi.org/10.1109/SMARTCOMP58114.2023.00021

Hasan, Zahid; Md Faridee, Abu Zaher; Ahmed, Masud; Ayyanar, Shibi; Roy, Nirmalya (June 2023, IEEE)
HeteroSys: Heterogeneous and Collaborative Sensing in the Wild

https://doi.org/10.1109/SMARTCOMP58114.2023.00073

Ghosh, Indrajeet; Goldstein, Adam; Chakma, Avijoy; Freeman, Jade; Gregory, Timothy; Suri, Niranjan; Ramamurthy, Sreenivasan Ramasamy; Roy, Nirmalya (June 2023, IEEE)
A Systematic Study on Object Recognition Using Millimeter-wave Radar

https://doi.org/10.1109/SMARTCOMP58114.2023.00025

Devnath, Maloy Kumar; Chakma, Avijoy; Anwar, Mohammad Saeid; Dey, Emon; Hasan, Zahid; Conn, Marc; Pal, Biplab; Roy, Nirmalya (June 2023, IEEE)
An Online Continuous Semantic Segmentation Framework With Minimal Labeling Efforts

https://doi.org/10.1109/SMARTCOMP58114.2023.00032

Ahmed, Masud; Hasan, Zahid; Yingling, Tim; O’Leary, Eric; Purushotham, Sanjay; You, Suya; Roy, Nirmalya (June 2023, IEEE)
CHARLIE: A Chatbot That Recommends Daily Fitness and Diet Plans

https://doi.org/10.1109/PerComWorkshops56833.2023.10150359

Chowdhury, Deepanjali; Roy, Ahana; Ramamurthy, Sreenivasan Ramasamy; Roy, Nirmalya (March 2023, IEEE)
SpecTextor: End-to-End Attention-based Mechanism for Dense Text Generation in Sports Journalism

https://doi.org/10.1109/SMARTCOMP55677.2022.00081

Ghosh, Indrajeet; Ivler, Matthew; Ramamurthy, Sreenivasan Ramasamy; Roy, Nirmalya (June 2022, 2022 IEEE International Conference on Smart Computing (SMARTCOMP))

Language-guided smart systems can help to design next-generation human-machine interactive applications. The dense text description is one of the research areas where systems learn the semantic knowledge and visual features of each video frame and map them to describe the video's most relevant subjects and events. In this paper, we consider untrimmed sports videos as our case study. Generating dense descriptions in the sports domain to supplement journalistic works without relying on commentators and experts requires more investigation. Motivated by this, we propose an end-to-end automated text-generator, SpecTextor, that learns the semantic features from untrimmed videos of sports games and generates associated descriptive texts. The proposed approach considers the video as a sequence of frames and sequentially generates words. After splitting videos into frames, we use a pre-trained VGG-16 model for feature extraction and encoding the video frames. With these encoded frames, we posit a Long Short-Term Memory (LSTM) based attention-decoder pipeline that leverages soft-attention mechanism to map the semantic features with relevant textual descriptions to generate the explanation of the game. Because developing a comprehensive description of the game warrants training on a set of dense time-stamped captions, we leverage two available public datasets: ActivityNet Captions and Microsoft Video Description. In addition, we utilized two different decoding algorithms: beam search and greedy search and computed two evaluation metrics: BLEU and METEOR scores.
more » « less
Full Text Available

« Prev Next »

Search for: All records