NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TailClipper: Reducing Tail Response Time of Distributed Services Through System-Wide Scheduling

https://doi.org/10.1145/3698038.3698554

Ng, Nathan; Souza, Abel; Ali-Eldin, Ahmed; Irwin, David; Towsley, Don; Shenoy, Prashant (November 2024, ACM)

Reducing tail latency has become a crucial issue for optimizing the performance of online cloud services and distributed applications. In distributed applications, there are many causes of high end-to-end tail latency, including operating system delays, request re-ordering due to fan-out/fanin, and network congestion. Although recent research has focused on reducing tail latency for individual application components, such as by replicating requests and scheduling, in this paper, we argue for a holistic approach for reducing the end-to-end tail latency across application components. We propose TailClipper, a distributed scheduler that tags each arriving request with an arrival timestamp, and propagates it across the microservices' call chain. TailClipper then uses arrival timestamps to implement an oldest request first scheduler that combines global first-come first serve with a limited form of processor sharing to reduce end-to-end tail latency. In doing so, TailClipper can counter the performance degradation caused by request reordering in multi-tiered and microservices-based applications. We implement TailClipper as a userspace Linux scheduler and evaluate it using cloud workload traces and a real-world microservices application. Compared to state-of-the-art schedulers, our experiments reveal that TailClipper improves the 99th percentile response time by up to 81%, while also improving the mean response time and the system throughput by up to 54% and 29% respectively under high loads.
more » « less
Free, publicly-accessible full text available November 20, 2025
Collaborative Inference in Resource-Constrained Edge Networks: Challenges and Opportunities

https://doi.org/10.1109/MILCOM61039.2024.10773876

Ng, Nathan; Souza, Abel; Diggavi, Suhas; Suri, Niranjan; Abdelzaher, Tarek; Towsley, Don; Shenoy, Prashant (October 2024, IEEE)

Many IoT applications have increasingly adopted machine learning (ML) techniques, such as classification and detection, to enhance automation and decision-making processes. With advances in hardware accelerators such as Nvidia’s Jetson embedded GPUs, the computational capabilities of end devices, particularly for ML inference workloads, have significantly improved in recent years. These advances have opened opportunities for distributing computation across the edge network, enabling optimal resource utilization and reducing request latency. Previous research has demonstrated promising results in collaborative inference, where processing units in the edge network, such as end devices and edge servers, collaboratively execute an inference request to minimize latency.This paper explores approaches for implementing collaborative inference on a single model in resource-constrained edge networks, including on-device, device-edge, and edge-edge collaboration. We present preliminary results from proof-of-concept experiments to support each case. We discuss dynamic factors that can impact the performance of these inference execution strategies, such as network variability, thermal constraints, and workload fluctuations. Finally, we outline potential directions for future research.
more » « less
Full Text Available
Succinate in the tumor microenvironment affects tumor growth and modulates tumor associated macrophages

https://doi.org/10.1016/j.biomaterials.2023.122292

Inamdar, Sahil; Suresh, Abhirami P.; Mangal, Joslyn L.; Ng, Nathan D.; Sundem, Alison; Behbahani, Hoda Shokrollahzadeh; Rubino, Thomas E.; Yaron, Jordan R.; Khodaei, Taravat; Green, Matthew; et al (October 2023, Biomaterials)

Full Text Available
Succinate based polymers drive immunometabolism in dendritic cells to generate cancer immunotherapy

https://doi.org/10.1016/j.jconrel.2023.05.014

Inamdar, Sahil; Suresh, Abhirami P.; Mangal, Joslyn L.; Ng, Nathan D.; Sundem, Alison; Behbahani, Hoda Shokrollahzadeh; Rubino, Thomas E.; Shi, Xiaojian; Loa, Sharon T.; Yaron, Jordan R.; et al (June 2023, Journal of Controlled Release)

Full Text Available
Localization dynamics in a centrally coupled system

https://doi.org/10.1103/PhysRevB.103.134201

Ng, Nathan; Wenderoth, Sebastian; Seelam, Rajagopala Reddy; Rabani, Eran; Meyer, Hans-Dieter; Thoss, Michael; Kolodrubetz, Michael (April 2021, Physical Review B)
null (Ed.)
Full Text Available

Search for: All records