NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Group Formation and Sampling in Group-Based Hierarchical Federated Learning

https://doi.org/10.1109/TCC.2024.3482865

Liu, Jiyao; Liu, Xuanzhang; Wei, Xinliang; Gao, Hongchang; Wang, Yu (October 2024, IEEE Transactions on Cloud Computing)

Full Text Available
Joint Participant and Learning Topology Selection for Federated Learning in Edge Clouds

https://doi.org/10.1109/TPDS.2024.3413751

Wei, Xinliang; Ye, Kejiang; Shi, Xinghua; Xu, Cheng-Zhong; Wang, Yu (August 2024, IEEE Transactions on Parallel and Distributed Systems)

Full Text Available
BPS: Batching, Pipelining, Surgeon of Continuous Deep Inference on Collaborative Edge Intelligence

https://doi.org/10.1109/TCC.2024.3399616

Hou, Xueyu; Guan, Yongjie; Choi, Nakjung; Han, Tao (July 2024, IEEE Transactions on Cloud Computing)

Users on edge generate deep inference requests continuously over time. Mobile/edge devices located near users can undertake the computation of inference locally for users, e.g., the embedded edge device on an autonomous vehicle. Due to limited computing resources on one mobile/edge device, it may be challenging to process the inference requests from users with high throughput. An attractive solution is to (partially) offload the computation to a remote device in the network. In this paper, we examine the existing inference execution solutions across local and remote devices and propose an adaptive scheduler, a BPS scheduler, for continuous deep inference on collaborative edge intelligence. By leveraging data parallel, neurosurgeon, reinforcement learning techniques, BPS can boost the overall inference performance by up to 8.2× over the baseline schedulers. A lightweight compressor, FF, specialized in compressing intermediate output data for neurosurgeon, is proposed and integrated into the BPS scheduler. FF exploits the operating character of convolutional layers and utilizes efficient approximation algorithms. Compared to existing compression methods, FF achieves up to 86.9% lower accuracy loss and up to 83.6% lower latency overhead.
more » « less
Full Text Available
Dystri: A Dynamic Inference based Distributed DNN Service Framework on Edge

https://doi.org/10.1145/3605573.3605598

Hou, Xueyu; Guan, Yongjie; Han, Tao (September 2023, ICPP '23: Proceedings of the 52nd International Conference on Parallel Processing)

Deep neural network (DNN) inference poses unique challenges in serving computational requests due to high request intensity, concurrent multi-user scenarios, and diverse heterogeneous service types. Simultaneously, mobile and edge devices provide users with enhanced computational capabilities, enabling them to utilize local resources for deep inference processing. Moreover, dynamic inference techniques allow content-based computational cost selection per request. This paper presents Dystri, an innovative framework devised to facilitate dynamic inference on distributed edge infrastructure, thereby accommodating multiple heterogeneous users. Dystri offers a broad applicability in practical environments, encompassing heterogeneous device types, DNN-based applications, and dynamic inference techniques, surpassing the state-of-the-art (SOTA) approaches. With distributed controllers and a global coordinator, Dystri allows per-request, per-user adjustments of quality-of-service, ensuring instantaneous, flexible, and discrete control. The decoupled workflows in Dystri naturally support user heterogeneity and scalability, addressing crucial aspects overlooked by existing SOTA works. Our evaluation involves three multi-user, heterogeneous DNN inference service platforms deployed on distributed edge infrastructure, encompassing seven DNN applications. Results show Dystri achieves near-zero deadline misses and excels in adapting to varying user numbers and request intensities. Dystri outperforms baselines with accuracy improvement up to 95 ×.
more » « less
Group-based Hierarchical Federated Learning: Convergence, Group Formation, and Sampling

https://doi.org/10.1145/3605573.3605584

Liu, Jiyao; Wei, Xinliang; Liu, Xuanzhang; Gao, Hongchang; Wang, Yu (August 2023, Proceedings of 52nd International Conference on Parallel Processing (ICPP 2023))
Joint Participant Selection and Learning Optimization for Federated Learning of Multiple Models in Edge Cloud

https://doi.org/10.1007/s11390-023-3074-4

Wei, Xinliang; Liu, Jiyao; Wang, Yu (July 2023, Journal of Computer Science and Technology)

Full Text Available
Quantum Assisted Scheduling Algorithm for Federated Learning in Distributed Networks

https://doi.org/10.1109/ICCCN58024.2023.10230094

Wei, Xinliang; Fan, Lei; Guo, Yuanxiong; Gong, Yanmin; Han, Zhu; Wang, Yu (July 2023, Proceedings of 32nd IEEE International Conference on Computer Communications and Networks (ICCCN 2023))

Full Text Available
Joint Optimization Across Timescales: Resource Placement and Task Dispatching in Edge Clouds

https://doi.org/10.1109/TCC.2021.3113605

Wei, Xinliang; Rahman, A B; Cheng, Dazhao; Wang, Yu (January 2023, IEEE Transactions on Cloud Computing)

Full Text Available
Popularity-Based Data Placement With Load Balancing in Edge Computing

https://doi.org/10.1109/TCC.2021.3096467

Wei, Xinliang; Wang, Yu (January 2023, IEEE Transactions on Cloud Computing)

Full Text Available
EdgeML: Towards network-accelerated federated learning over wireless edge

https://doi.org/10.1016/j.comnet.2022.109396

Pinyoanuntapong, Pinyarash; Janakaraj, Prabhu; Balakrishnan, Ravikumar; Lee, Minwoo; Chen, Chen; Wang, Pu (December 2022, Computer Networks)

Full Text Available

« Prev Next »

Search for: All records