NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Inference latency prediction for CNNs on heterogeneous mobile devices and ML frameworks

https://doi.org/10.1016/j.peva.2024.102429

Li, Zhuojin; Paolieri, Marco; Golubchik, Leana (August 2024, Performance Evaluation)

Full Text Available
When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning

https://doi.org/10.1109/TNET.2024.3374755

Huang, Jiatai; Golubchik, Leana; Huang, Longbo (August 2024, IEEE/ACM Transactions on Networking)

Full Text Available
A Benchmark for ML Inference Latency on Mobile Devices

https://doi.org/10.1145/3642968.3654818

Li, Zhuojin; Paolieri, Marco; Golubchik, Leana (April 2024, ACM)

Full Text Available
Predicting Inference Latency of Neural Architectures on Mobile Devices

https://doi.org/10.1145/3578244.3583735

Li, Zhuojin; Paolieri, Marco; Golubchik, Leana (April 2023, International Conference on Performance Engineering)

Full Text Available
Defending against Poisoning Backdoor Attacks on Federated Meta-learning

https://doi.org/10.1145/3523062

Chen, Chien-Lun; Babakniya, Sara; Paolieri, Marco; Golubchik, Leana (October 2022, ACM Transactions on Intelligent Systems and Technology)

Federated learning allows multiple users to collaboratively train a shared classification model while preserving data privacy. This approach, where model updates are aggregated by a central server, was shown to be vulnerable to poisoning backdoor attacks : a malicious user can alter the shared model to arbitrarily classify specific inputs from a given class. In this article, we analyze the effects of backdoor attacks on federated meta-learning , where users train a model that can be adapted to different sets of output classes using only a few examples. While the ability to adapt could, in principle, make federated learning frameworks more robust to backdoor attacks (when new training examples are benign), we find that even one-shot attacks can be very successful and persist after additional training. To address these vulnerabilities, we propose a defense mechanism inspired by matching networks , where the class of an input is predicted from the similarity of its features with a support set of labeled examples. By removing the decision logic from the model shared with the federation, the success and persistence of backdoor attacks are greatly reduced.
more » « less
Full Text Available
Sojourn Time Minimization of Successful Jobs

https://doi.org/10.1145/3561074.3561083

Yao, Yuan; Paolieri, Marco; Golubchik, Leana (August 2022, ACM SIGMETRICS Performance Evaluation Review)

Due to a growing interest in deep learning applications [5], compute-intensive and long-running (hours to days) training jobs have become a significant component of datacenter workloads. A large fraction of these jobs is often exploratory, with the goal of determining the best model structure (e.g., the number of layers and channels in a convolutional neural network), hyperparameters (e.g., the learning rate), and data augmentation strategies for the target application. Notably, training jobs are often terminated early if their learning metrics (e.g., training and validation accuracy) are not converging, with only a few completing successfully. For this motivating application, we consider the problem of scheduling a set of jobs that can be terminated at predetermined checkpoints with known probabilities estimated from historical data. We prove that, in order to minimize the time to complete the first K successful jobs on a single server, optimal scheduling does not require preemption (even when preemption overhead is negligible) and provide an optimal policy; advantages of this policy are quantified through simulation. Related Work. While job scheduling has been investigated extensively in many scenarios (see [6] and [2] for a survey of recent result), most policies require that the cost of waiting times of each job be known at scheduling time; in contrast, in our setting the scheduler does not know which job will be the K-th successful job, and sojourn times of subsequent jobs do not contribute to the target metric. For example, [4, 3] minimize makespan (i.e., the time to complete all jobs) for known execution times and waiting time costs; similarly, Gittins index [1] and SR rank [7] minimize expected sojourn time of all jobs, i.e., both successfully completed jobs and jobs terminated early. Unfortunately, scheduling policies not distinguishing between these two types of jobs may favor jobs where the next stage is short and leads to early termination with high probability, which is an undesirable outcome in our applications of interest.
more » « less
Full Text Available
Achieving Transparency Report Privacy in Linear Time

https://doi.org/10.1145/3460001

Chen, Chien-Lun; Golubchik, Leana; Pal, Ranjan (June 2022, Journal of Data and Information Quality)

An accountable algorithmic transparency report (ATR) should ideally investigate (a) transparency of the underlying algorithm, and (b) fairness of the algorithmic decisions, and at the same time preserve data subjects’ privacy . However, a provably formal study of the impact to data subjects’ privacy caused by the utility of releasing an ATR (that investigates transparency and fairness), has yet to be addressed in the literature. The far-fetched benefit of such a study lies in the methodical characterization of privacy-utility trade-offs for release of ATRs in public, and their consequential application-specific impact on the dimensions of society, politics, and economics. In this paper, we first investigate and demonstrate potential privacy hazards brought on by the deployment of transparency and fairness measures in released ATRs. To preserve data subjects’ privacy, we then propose a linear-time optimal-privacy scheme , built upon standard linear fractional programming (LFP) theory, for announcing ATRs, subject to constraints controlling the tolerance of privacy perturbation on the utility of transparency schemes. Subsequently, we quantify the privacy-utility trade-offs induced by our scheme, and analyze the impact of privacy perturbation on fairness measures in ATRs. To the best of our knowledge, this is the first analytical work that simultaneously addresses trade-offs between the triad of privacy, utility, and fairness, applicable to algorithmic transparency reports.
more » « less
Full Text Available
Performance and Revenue Analysis of Hybrid Cloud Federations with QoS Requirements

https://doi.org/10.1109/CLOUD55607.2022.00055

B. Song, M. Paolieri (January 2022, IEEE Cloud 2022)

Full Text Available
Deep-n-Cheap: An Automated Efficient and Extensible Search Framework for Cost-Effective Deep Learning

https://doi.org/10.1007/s42979-021-00646-0

Dey, Sourya; Babakniya, Sara; Kanala, Saikrishna C.; Paolieri, Marco; Golubchik, Leana; Beerel, Peter A.; Chugg, Keith M. (July 2021, SN Computer Science)
null (Ed.)
Full Text Available
Are Federated Cloud Sharing Systems Sustainable?: On Dynamic Sharing Markets and Their Stability

https://doi.org/10.1109/TSUSC.2019.2955093

Pal, Ranjan; Lin, Sung-Han; Ahuja, Aditya; Jagadeesan, Nachikethas; Kumar, Abhishek; Golubchik, Leana (October 2020, IEEE Transactions on Sustainable Computing)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records