NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation

Zhao, Qitao; Zhang, Ce; Liu, Mengyuan; Chen, Chen (September 2023, Thirty-Seventh Annual Conference on Neural Information Processing Systems (NeurIPS))
Atlas: automate online service configuration in network slicing

https://doi.org/10.1145/3555050.3569115

Liu, Qiang; Choi, Nakjung; Han, Tao (November 2022, Proceedings of the 18th International Conference on Emerging Networking EXperiments and Technologies (CoNEXT)))

Full Text Available
NeuLens: spatial-based dynamic acceleration of convolutional neural networks on edge

https://doi.org/10.1145/3495243.3560528

Hou, Xueyu; Guan, Yongjie; Han, Tao (October 2022, MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing And Networking)

Convolutional neural networks (CNNs) play an important role in today's mobile and edge computing systems for vision-based tasks like object classification and detection. However, state-of-the-art methods on CNN acceleration are trapped in either limited practical latency speed-up on general computing platforms or latency speed-up with severe accuracy loss. In this paper, we propose a spatial-based dynamic CNN acceleration framework, NeuLens, for mobile and edge platforms. Specially, we design a novel dynamic inference mechanism, assemble region-aware convolution (ARAC) supernet, that peels off redundant operations inside CNN models as many as possible based on spatial redundancy and channel slicing. In ARAC supernet, the CNN inference flow is split into multiple independent micro-flows, and the computational cost of each can be autonomously adjusted based on its tiled-input content and application requirements. These micro-flows can be loaded into hardware like GPUs as single models. Consequently, its operation reduction can be well translated into latency speed-up and is compatible with hardware-level accelerations. Moreover, the inference accuracy can be well preserved by identifying critical regions on images and processing them in the original resolution with large micro-flow. Based on our evaluation, NeuLens outperforms baseline methods by up to 58% latency reduction with the same accuracy and by up to 67.9% accuracy improvement under the same latency/memory constraints.
more » « less
Full Text Available
A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Human Pose

https://doi.org/10.1145/3503161.3547844

Zheng, Ce; Mendieta, Matias; Wang, Pu; Lu, Aidong; Chen, Chen (October 2022, ACM)

Full Text Available
DeepMix: mobility-aware, lightweight, and hybrid 3D object detection for headsets

https://doi.org/10.1145/3498361.3538945

Guan, Yongjie; Hou, Xueyu; Wu, Nan; Han, Bo; Han, Tao (June 2022, the 20th Annual International Conference on Mobile Systems, Applications and Services)

Mobile headsets should be capable of understanding 3D physical environments to offer a truly immersive experience for augmented/mixed reality (AR/MR). However, their small form-factor and limited computation resources make it extremely challenging to execute in real-time 3D vision algorithms, which are known to be more compute-intensive than their 2D counterparts. In this paper, we propose DeepMix, a mobility-aware, lightweight, and hybrid 3D object detection framework for improving the user experience of AR/MR on mobile headsets. Motivated by our analysis and evaluation of state-of-the-art 3D object detection models, DeepMix intelligently combines edge-assisted 2D object detection and novel, on-device 3D bounding box estimations that leverage depth data captured by headsets. This leads to low end-to-end latency and significantly boosts detection accuracy in mobile scenarios. A unique feature of DeepMix is that it fully exploits the mobility of headsets to fine-tune detection results and boost detection accuracy. To the best of our knowledge, DeepMix is the first 3D object detection that achieves 30 FPS (i.e., an end-to-end latency much lower than the 100 ms stringent requirement of interactive AR/MR). We implement a prototype of DeepMix on Microsoft HoloLens and evaluate its performance via both extensive controlled experiments and a user study with 30+ participants. DeepMix not only improves detection accuracy by 9.1--37.3% but also reduces end-to-end latency by 2.68--9.15×, compared to the baseline that uses existing 3D object detection models.
more » « less
Full Text Available
DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices

https://doi.org/10.1109/IPDPS53621.2022.00110

Hou, Xueyu; Guan, Yongjie; Han, Tao; Zhang, Ning (May 2022, 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS))

Full Text Available
Constraint-Aware Deep Reinforcement Learning for End-to-End Resource Orchestration in Mobile Networks

https://doi.org/10.1109/ICNP52444.2021.9651934

Liu, Qiang; Choi, Nakjung; Han, Tao (November 2021, IEEE 29th International Conference on Network Protocols (ICNP))

Network slicing is a promising technology that allows mobile network operators to efficiently serve various emerging use cases in 5G. It is challenging to optimize the utilization of network infrastructures while guaranteeing the performance of network slices according to service level agreements (SLAs). To solve this problem, we propose SafeSlicing that introduces a new constraint-aware deep reinforcement learning (CaDRL) algorithm to learn the optimal resource orchestration policy within two steps, i.e., offline training in a simulated environment and online learning with the real network system. On optimizing the resource orchestration, we incorporate the constraints on the statistical performance of slices in the reward function using Lagrangian multipliers and solve the Lagrangian relaxed problem via a policy network. To satisfy the constraints on the system capacity, we design a constraint network to map the latent actions generated from the policy network to the orchestration actions such that the total resources allocated to network slices do not exceed the system capacity. We prototype SafeSlicing on an end-to-end testbed developed by using OpenAirInterface LTE, OpenDayLight-based SDN, and CUDA GPU computing platform. The experimental results show that SafeSlicing reduces more than 20% resource usage while meeting SLAs of network slices as compared with other solutions.
more » « less
Full Text Available
3D Human Pose Estimation with Spatial and Temporal Transformers

https://doi.org/10.1109/ICCV48922.2021.01145

Zheng, Ce; Zhu, Sijie; Mendieta, Matias; Yang, Taojiannan; Chen, Chen; Ding, Zhengming (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

Full Text Available
Efficient unsupervised monocular depth estimation using attention guided generative adversarial network

https://doi.org/10.1007/s11554-021-01092-0

Bhattacharyya, Sumanta; Shen, Ju; Welch, Stephen; Chen, Chen (August 2021, Journal of Real-Time Image Processing)
null (Ed.)
Full Text Available
VIGOR: Cross-View Image Geo-localization beyond One-to-one Retrieval

https://doi.org/10.1109/CVPR46437.2021.00364

Zhu, Sijie; Yang, Taojiannan; Chen, Chen (June 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available

« Prev Next »

Search for: All records