NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TaPS: A Performance Evaluation Suite for Task-based Execution Frameworks

https://doi.org/10.1109/e-Science62913.2024.10678702

Pauloski, J Gregory; Hayot-Sasson, Valerie; Gonthier, Maxime; Hudson, Nathaniel; Pan, Haochen; Zhou, Sicheng; Foster, Ian; Chard, Kyle (September 2024, IEEE)

Full Text Available
An Empirical Investigation of Container Building Strategies and Warm Times to Reduce Cold Starts in Scientific Computing Serverless Functions

https://doi.org/10.1109/e-Science62913.2024.10678668

Bauer, André; Gonthier, Maxime; Pan, Haochen; Chard, Ryan; Grzenda, Daniel; Straesser, Martin; Pauloski, J Gregory; Kamatar, Alok; Baughman, Matt; Hudson, Nathaniel; et al (September 2024, IEEE)

Full Text Available
Tournament-Based Pretraining to Accelerate Federated Learning

https://doi.org/10.1145/3624062.3626089

Baughman, Matt; Hudson, Nathaniel; Chard, Ryan; Bauer, Andre; Foster, Ian; Chard, Kyle (November 2023, SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis)
Balancing Federated Learning Trade-Offs for Heterogeneous Environments

https://doi.org/10.1109/PerComWorkshops56833.2023.10150228

Baughman, Matt; Hudson, Nathaniel; Foster, Ian; Chard, Kyle (March 2023, IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops))
FLoX: Federated Learning with FaaS at the Edge

https://doi.org/10.1109/eScience55777.2022.00016

Kotsehub, Nikita; Baughman, Matt; Chard, Ryan; Hudson, Nathaniel; Patros, Panos; Rana, Omer; Foster, Ian; Chard, Kyle (October 2022, 18th International Conference on e-Science (e-Science))

Full Text Available
Communication-Loss Trade-Off in Federated Learning: A Distributed Client Selection Algorithm

https://doi.org/10.1109/CCNC49033.2022.9700601

Hosseinzadeh, Minoo; Hudson, Nathaniel; Heshmati, Sam; Khamfroush, Hana (January 2022, IEEE Communications & Networking (CCNC) 2022 - Workshop on Secure Function Chaining and Federated AI (SONATAI))

Mass data generation occurring in the Internet- of-Things (IoT) requires processing to extract meaningful in- formation. Deep learning is commonly used to perform such processing. However, due to the sensitive nature of these data, it is important to consider data privacy. As such, federated learning (FL) has been proposed to address this issue. FL pushes training to the client devices and tasks a central server with aggregating collected model weights to update a global model. However, the transmission of these model weights can be costly, gradually. The trade-off between communicating model weights for aggregation and the loss provided by the global model remains an open problem. In this work, we cast this trade-off problem of client selection in FL as an optimization problem. We then design a Distributed Client Selection (DCS) algorithm that allows client devices to decide to participate in aggregation in hopes of minimizing overall communication cost — while maintaining low loss. We evaluate the performance of our proposed client selection algorithm against standard FL and a state-of-the-art client selection algorithm, called Power-of-Choice (PoC), using CIFAR-10, FMNIST, and MNIST datasets. Our experimental results confirm that our DCS algorithm is able to closely match the loss provided by the standard FL and PoC, while on average reducing the overall communication cost by nearly 32.67% and 44.71% in comparison to standard FL and PoC, respectively.
more » « less
Full Text Available
Joint Compression and Offloading Decisions for Deep Learning Services in 3-Tier Edge Systems

https://doi.org/10.1109/DySPAN53946.2021.9677398

Hosseinzadeh, Minoo; Hudson, Nathaniel; Zhao, Xiaobo; Khamfroush, Hana; Lucani, Daniel E. (December 2021, 2021 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN))

Task offloading in edge computing infrastructure remains a challenge for dynamic and complex environments, such as Industrial Internet-of-Things. The hardware resource constraints of edge servers must be explicitly considered to ensure that system resources are not overloaded. Many works have studied task offloading while focusing primarily on ensuring system resilience. However, in the face of deep learning-based services, model performance with respect to loss/accuracy must also be considered. Deep learning services with different implementations may provide varying amounts of loss/accuracy while also being more complex to run inference on. That said, communication latency can be reduced to improve overall Quality-of-Service by employing compression techniques. However, such techniques can also have the side-effect of reducing the loss/accuracy provided by deep learning-based service. As such, this work studies a joint optimization problem for task offloading decisions in 3-tier edge computing platforms where decisions regarding task offloading are made in tandem with compression decisions. The objective is to optimally offload requests with compression such that the trade-off between latency-accuracy is not greatly jeopardized. We cast this problem as a mixed integer nonlinear program. Due to its nonlinear nature, we then decompose it into separate subproblems for offloading and compression. An efficient algorithm is proposed to solve the problem. Empirically, we show that our algorithm attains roughly a 0.958-approximation of the optimal solution provided by a block coordinate descent method for solving the two sub-problems back-to-back.
more » « less
Full Text Available
QoS-Aware Placement of Deep Learning Services on the Edge with Multiple Service Implementations

https://doi.org/10.1109/ICCCN52240.2021.9522156

Hudson, Nathaniel; Khamfroush, Hana; Lucani, Daniel E. (July 2021, IEEE ICCCN Big Data and Machine Learning for Networking (BDMLN) Workshop • 2021)

Mobile edge computing pushes computationally-intensive services closer to the user to provide reduced delay due to physical proximity. This has led many to consider deploying deep learning models on the edge – commonly known as edge intelligence (EI). EI services can have many model implementations that provide different QoS. For instance, one model can perform inference faster than another (thus reducing latency) while achieving less accuracy when evaluated. In this paper, we study joint service placement and model scheduling of EI services with the goal to maximize Quality-of-Servcice (QoS) for end users where EI services have multiple implementations to serve user requests, each with varying costs and QoS benefits. We cast the problem as an integer linear program and prove that it is NP-hard. We then prove the objective is equivalent to maximizing a monotone increasing, submodular set function and thus can be solved greedily while maintaining a (1 – 1/e)-approximation guarantee. We then propose two greedy algorithms: one that theoretically guarantees this approximation and another that empirically matches its performance with greater efficiency. Finally, we thoroughly evaluate the proposed algorithm for making placement and scheduling decisions in both synthetic and real-world scenarios against the optimal solution and some baselines. In the real-world case, we consider real machine learning models using the ImageNet 2012 data-set for requests. Our numerical experiments empirically show that our more efficient greedy algorithm is able to approximate the optimal solution with a 0.904 approximation on average, while the next closest baseline achieves a 0.607 approximation on average.
more » « less
Full Text Available

Search for: All records