NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hardware Acceleration for Post-Decision State Reinforcement Learning in IoT Systems

https://doi.org/10.1109/JIOT.2022.3163364

Sun, Jianchi; Sharma, Nikhilesh; Chakareski, Jacob; Mastronarde, Nicholas; Lao, Yingjie (January 2022, IEEE Internet of Things Journal)

Full Text Available
Improving Data-Driven Reinforcement Learning in Wireless IoT Systems Using Domain Knowledge

https://doi.org/10.1109/MCOM.111.2000949

Mastronarde, Nicholas; Sharma, Nikhilesh; Chakareski, Jacob (November 2021, IEEE Communications Magazine)

Full Text Available
Mobile-Edge Cooperative Multi-User 360° Video Computing and Streaming

https://doi.org/10.1109/MMSP48831.2020.9287148

Chakareski, Jacob; Mastronarde, Nicholas (September 2020, 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP))
null (Ed.)
We investigate a novel communications system that integrates scalable multi-layer 360-degree video tiling, viewport-adaptive rate-distortion optimal resource allocation, and VR-centric edge computing and caching, to enable future high-quality untethered VR streaming. Our system comprises a collection of 5G small cells that can pool their communication, computing, and storage resources to collectively deliver scalable 360-degree video content to mobile VR clients at much higher quality. Our major contributions are rigorous design of multi-layer 360-degree tiling and related models of statistical user navigation, and analysis and optimization of edge-based multi-user VR streaming that integrates viewport adaptation and server cooperation. We also explore the possibility of network coded data operation and its implications for the analysis, optimization, and system performance we pursue here. We demonstrate considerable gains in delivered immersion fidelity, featuring much higher 360-degree viewport peak signal to noise ratio (PSNR) and VR video frame rates and spatial resolutions.
more » « less
Full Text Available
Deep Reinforcement Learning for Delay-Sensitive LTE Downlink Scheduling

https://doi.org/10.1109/PIMRC48278.2020.9217110

Sharma, Nikhilesh; Zhang, Sen; Somayajula Venkata, Someshwar Rao; Malandra, Filippo; Mastronarde, Nicholas; Chakareski, Jacob (August 2020, 2020 IEEE 31st Annual International Symposium on Personal, Indoor and Mobile Radio Communications)
null (Ed.)
We consider an LTE downlink scheduling system where a base station allocates resource blocks (RBs) to users running delay-sensitive applications. We aim to find a scheduling policy that minimizes the queuing delay experienced by the users. We formulate this problem as a Markov Decision Process (MDP) that integrates the channel quality indicator (CQI) of each user in each RB, and queue status of each user. To solve this complex problem involving high dimensional state and action spaces, we propose a Deep Reinforcement Learning based scheduling framework that utilizes the Deep Deterministic Policy Gradient (DDPG) algorithm to minimize the queuing delay experienced by the users. Our extensive experiments demonstrate that our approach outperforms state-of-the-art benchmarks in terms of average throughput, queuing delay, and fairness, achieving up to 55% lower queuing delay than the best benchmark.
more » « less
Full Text Available
Action Evaluation Hardware Accelerator for Next-Generation Real-Time Reinforcement Learning in Emerging IoT Systems

https://doi.org/10.1109/ISVLSI49217.2020.00084

Sun, Jianchi; Sharma, Nikhilesh; Chakareski, Jacob; Mastronarde, Nicholas; Lao, Yingjie (July 2020, 2020 IEEE Computer Society Annual Symposium on VLSI (ISVLSI))
null (Ed.)
Internet of Things (IoT) sensors often operate in unknown dynamic environments comprising latency-sensitive data sources, dynamic processing loads, and communication channels of unknown statistics. Such settings represent a natural application domain of reinforcement learning (RL), which enables computing and learning decision policies online, with no a priori knowledge. In our previous work, we introduced a post-decision state (PDS) based RL framework, which considerably accelerates the rate of learning an optimal decision policy. The present paper formulates an efficient hardware architecture for the action evaluation step, which is the most computationally-intensive step in the PDS based learning framework. By leveraging the unique characteristics of PDS learning, we optimize its state value expectation and known cost computational blocks, to speed-up the overall computation. Our experiments show that the optimized circuit is 49 times faster than its software implementation counterpart, and six times faster than a Q-learning hardware accelerator.
more » « less
Full Text Available
Delay-Sensitive Energy-Harvesting Wireless Sensors: Optimal Scheduling, Structural Properties, and Approximation Analysis

https://doi.org/10.1109/TCOMM.2019.2956510

Sharma, Nikhilesh; Mastronarde, Nicholas; Chakareski, Jacob (April 2020, IEEE Transactions on Communications)

Full Text Available
Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors

https://doi.org/10.1109/TSP.2020.2973125

Sharma, Nikhilesh; Mastronarde, Nicholas; Chakareski, Jacob (January 2020, IEEE Transactions on Signal Processing)

Full Text Available
UAV-IoT for Next Generation Virtual Reality

https://doi.org/10.1109/TIP.2019.2921869

Chakareski, Jacob (December 2019, IEEE Transactions on Image Processing)

Full Text Available
Displacement Error Analysis of 6-DoF Virtual Reality

https://doi.org/10.1145/3349801.3349812

Aksu, Ridvan; Chakareski, Jacob; Velisavljevic, Vladan (September 2019, Proc. ACM Int'l Conf. on Distributed Smart Cameras)

Full Text Available
Simulating unmanned aerial vehicle swarms with the UB-ANC Emulator

https://doi.org/10.1177/1756829319837668

Modares, Jalil; Mastronarde, Nicholas; Dantu, Karthik (April 2019, International Journal of Micro Air Vehicles)

Recent advances in multi-rotor vehicle control and miniaturization of hardware, sensing, and battery technologies have enabled cheap, practical design of micro air vehicles for civilian and hobby applications. In parallel, several applications are being envisioned that bring together a swarm of multiple networked micro air vehicles to accomplish large tasks in coordination. However, it is still very challenging to deploy multiple micro air vehicles concurrently. To address this challenge, we have developed an open software/hardware platform called the University at Buffalo’s Airborne Networking and Communications Testbed (UB-ANC), and an associated emulation framework called the UB-ANC Emulator. In this paper, we present the UB-ANC Emulator, which combines multi-micro air vehicle planning and control with high-fidelity network simulation, enables practitioners to design micro air vehicle swarm applications in software and provides seamless transition to deployment on actual hardware. We demonstrate the UB-ANC Emulator’s accuracy against experimental data collected in two mission scenarios: a simple mission with three networked micro air vehicles and a sophisticated coverage path planning mission with a single micro air vehicle. To accurately reflect the performance of a micro air vehicle swarm where communication links are subject to interference and packet losses, and protocols at the data link, network, and transport layers affect network throughput, latency, and reliability, we integrate the open-source discrete-event network simulator ns-3 into the UB-ANC Emulator. We demonstrate through node-to-node and end-to-end measurements how the UB-ANC Emulator can be used to simulate multiple networked micro air vehicles with accurate modeling of mobility, control, wireless channel characteristics, and network protocols defined in ns-3.
more » « less

« Prev Next »

Search for: All records