NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Atlas: automate online service configuration in network slicing

https://doi.org/10.1145/3555050.3569115

Liu, Qiang; Choi, Nakjung; Han, Tao (November 2022, Proceedings of the 18th International Conference on Emerging Networking EXperiments and Technologies (CoNEXT)))

Full Text Available
NeuLens: spatial-based dynamic acceleration of convolutional neural networks on edge

https://doi.org/10.1145/3495243.3560528

Hou, Xueyu; Guan, Yongjie; Han, Tao (October 2022, MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And Networking)

Convolutional neural networks (CNNs) play an important role in today's mobile and edge computing systems for vision-based tasks like object classification and detection. However, state-of-the-art methods on CNN acceleration are trapped in either limited practical latency speed-up on general computing platforms or latency speed-up with severe accuracy loss. In this paper, we propose a spatial-based dynamic CNN acceleration framework, NeuLens, for mobile and edge platforms. Specially, we design a novel dynamic inference mechanism, assemble region-aware convolution (ARAC) supernet, that peels off redundant operations inside CNN models as many as possible based on spatial redundancy and channel slicing. In ARAC supernet, the CNN inference flow is split into multiple independent micro-flows, and the computational cost of each can be autonomously adjusted based on its tiled-input content and application requirements. These micro-flows can be loaded into hardware like GPUs as single models. Consequently, its operation reduction can be well translated into latency speed-up and is compatible with hardware-level accelerations. Moreover, the inference accuracy can be well preserved by identifying critical regions on images and processing them in the original resolution with large micro-flow. Based on our evaluation, NeuLens outperforms baseline methods by up to 58% latency reduction with the same accuracy and by up to 67.9% accuracy improvement under the same latency/memory constraints.
more » « less
Full Text Available
Deep Reinforcement Learning for End-to-End Network Slicing: Challenges and Solutions

https://doi.org/10.1109/MNET.113.2100739

Liu, Qiang; Choi, Nakjung; Han, Tao (August 2022, IEEE Network)

Full Text Available
TrustServing: A Quality Inspection Sampling Approach for Remote DNN Services

https://doi.org/10.1109/SECON48991.2020.9158444

Hou, Xueyu; Han, Tao (June 2020, 2020 17th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON))

Deep neural networks (DNNs) are being applied to various areas such as computer vision, autonomous vehicles, and healthcare, etc. However, DNNs are notorious for their high computational complexity and cannot be executed efficiently on resource constrained Internet of Things (IoT) devices. Various solutions have been proposed to handle the high computational complexity of DNNs. Offloading computing tasks of DNNs from IoT devices to cloud/edge servers is one of the most popular and promising solutions. While such remote DNN services provided by servers largely reduce computing tasks on IoT devices, it is challenging for IoT devices to inspect whether the quality of the service meets their service level objectives (SLO) or not. In this paper, we address this problem and propose a novel approach named QIS (quality inspection sampling) that can efficiently inspect the quality of the remote DNN services for IoT devices. To realize QIS, we design a new ID-generation method to generate data (IDs) that can identify the serving DNN models on edge servers. QIS inserts the IDs into the input data stream and implements sampling inspection on SLO violations. The experiment results show that the QIS approach can reliably inspect, with a nearly 100% success rate, the service qualtiy of remote DNN services when the SLA level is 99.9% or lower at the cost of only up to 0.5% overhead.
more » « less
Full Text Available
Learning-Assisted Secure End-to-End Network Slicing for Cyber-Physical Systems

https://doi.org/10.1109/MNET.011.1900303

Liu, Qiang; Han, Tao; Ansari, Nirwan (May 2020, IEEE Network)

Full Text Available
Density Map Guided Object Detection in Aerial Images

Li, Changlin; Yang, Taojiannan; Zhu, Sijie; Chen, Chen; Guan, Shanyue (January 2020, IEEE Conference on Computer Vision and Pattern Recognition Workshop)

Object detection in high-resolution aerial images is a challenging task because of 1) the large variation in object size, and 2) non-uniform distribution of objects. A common solution is to divide the large aerial image into small (uniform) crops and then apply object detection on each small crop. In this paper, we investigate the image cropping strategy to address these challenges. Specifically, we propose a Density-Map guided object detection Network (DMNet), which is inspired from the observation that the object density map of an image presents how objects distribute in terms of the pixel intensity of the map. As pixel intensity varies, it is able to tell whether a region has objects or not, which in turn provides guidance for cropping images statistically. DMNet has three key components: a density map generation module, an image cropping module and an object detector. DMNet generates a density map and learns scale information based on density intensities to form cropping regions. Extensive experiments show that DMNet achieves state-of-the-art performance on two popular aerial image datasets, i.e. VisionDrone and UAVDT.
more » « less
Full Text Available
MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution

Yang, Taojiannan; Zhu, Sijie; Chen, Chen; Yan, Shen; Zhang, Mi; Willis, Andrew (January 2020, European Conference on Computer Vision)

We propose the width-resolution mutual learning method (MutualNet) to train a network that is executable at dynamic resource constraints to achieve adaptive accuracy-efficiency trade-offs at runtime. Our method trains a cohort of sub-networks with different widths (i.e., number of channels in a layer) using different input resolutions to mutually learn multi-scale representations for each sub-network. It achieves consistently better ImageNet top-1 accuracy over the state-of-the-art adaptive network US-Net under different computation constraints, and outperforms the best compound scaled MobileNet in EfficientNet by 1.5%. The superiority of our method is also validated on COCO object detection and instance segmentation as well as transfer learning. Surprisingly, the training strategy of MutualNet can also boost the performance of a single network, which substantially outperforms the powerful AutoAugmentation in both efficiency (GPU search hours: 15000 vs. 0) and accuracy (ImageNet: 77.6% vs. 78.6%). Code is available at https://github.com/ aoyang1122/MutualNet
more » « less
Full Text Available
FedVision: Federated Video Analytics With Edge Computing

https://doi.org/10.1109/OJCS.2020.2996184

Deng, Yang; Han, Tao; Ansari, Nirwan (January 2020, IEEE Open Journal of the Computer Society)

Full Text Available
DIRECT: Distributed Cross-Domain Resource Orchestration in Cellular Edge Computing

https://doi.org/10.1145/3323679.3326516

Liu, Qiang; Han, Tao (July 2019, ACM International Symposium on Mobile Ad Hoc Networking and Computing)

Full Text Available

Search for: All records