NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Building Socially-Equitable Public Models

Liu, Yejia; Yang, Jianyi; Li, Pengfei; Li, Tongxin; Ren, Shaolei (July 2024, 2024 International Conference on Machine Learning (ICML))

Public models offer predictions to a variety of downstream tasks and have played a crucial role in various AI applications, showcasing their proficiency in accurate predictions. However, the exclusive emphasis on prediction accuracy may not align with the diverse end objectives of downstream agents. Recognizing the public model's predictions as a service, we advocate for integrating the objectives of downstream agents into the optimization process. Concretely, to address performance disparities and foster fairness among heterogeneous agents in training, we propose a novel Equitable Objective. This objective, coupled with a policy gradient algorithm, is crafted to train the public model to produce a more equitable/uniform performance distribution across downstream agents, each with their unique concerns. Both theoretical analysis and empirical case studies have proven the effectiveness of our method in advancing performance equity across diverse downstream agents utilizing the public model for their decision-making.
more » « less
Full Text Available
Towards Environmentally Equitable AI via Geographical Load Balancing

https://doi.org/10.1145/3632775.3661938

Li, Pengfei; Yang, Jianyi; Wierman, Adam; Ren, Shaolei (May 2024, ACM International Conference on Future and Sustainable Energy Systems (e-Energy) 2024)

Fueled by the soaring popularity of foundation models, the accelerated growth of artificial intelligence (AI) models’ enormous environmental footprint has come under increased scrutiny. While many approaches have been proposed to make AI more energy-efficient and environmentally friendly, environmental inequity — the fact that AI’s environmental footprint can be disproportionately higher in certain regions than in others — has emerged, raising social-ecological justice concerns. This paper takes a first step toward addressing AI’s environmental inequity by fairly balancing its regional environmental impact. Concretely, we focus on the carbon and water footprints of AI model inference and propose equity-aware geographical load balancing (eGLB) to explicitly minimize AI’s highest environmental cost across all the regions. The consideration of environmental equity creates substantial algorithmic challenges as the optimal GLB decisions require complete offline information that is lacking practice. To address the challenges, we introduce auxiliary variables and optimize GLB decisions online based on dual mirror descent. In addition to analyzing the performance of eGLB theoretically, we run trace-based empirical simulations by considering a set of geographically distributed data centers that serve inference requests for a large language AI model. The results demonstrate that existing GLB approaches may amplify environmental inequity while eGLB can significantly reduce the regional disparity in terms of carbon and water footprints.
more » « less
Full Text Available
Online Allocation with Replenishable Budgets: Worst Case and Beyond

https://doi.org/10.1145/3639030

Yang, Jianyi; Li, Pengfei; Islam, Mohammad Jaminur; Ren, Shaolei (February 2024, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

This paper studies online resource allocation with replenishable budgets, where budgets can be replenished on top of the initial budget and an agent sequentially chooses online allocation decisions without violating the available budget constraint at each round. We propose a novel online algorithm, called OACP (Opportunistic Allocation with Conservative Pricing), that conservatively adjusts dual variables while opportunistically utilizing available resources. OACP achieves a bounded asymptotic competitive ratio in adversarial settings as the number of decision rounds T gets large. Importantly, the asymptotic competitive ratio of OACP is optimal in the absence of additional assumptions on budget replenishment. To further improve the competitive ratio, we make a mild assumption that there is budget replenishment every T* ≥ 1 decision rounds and propose OACP+ to dynamically adjust the total budget assignment for online allocation. Next, we move beyond the worst-case and propose LA-OACP (Learning-Augmented OACP/OACP+), a novel learning-augmented algorithm for online allocation with replenishable budgets. We prove that LA-OACP can improve the average utility compared to OACP/OACP+ when the ML predictor is properly trained, while still offering worst-case utility guarantees when the ML predictions are arbitrarily wrong. Finally, we run simulation studies of sustainable AI inference powered by renewables, validating our analysis and demonstrating the empirical benefits of LA-OACP.
more » « less
Full Text Available
Anytime-Competitive Reinforcement Learning with Policy Prior

Yang, Jianyi; Li, Pengfei; Li, Tongxin; Wierman, Adam; Ren, Shaolei (December 2023, International Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
Learning-Assisted Algorithm Unrolling for Online Optimization with Budget Constraints

https://doi.org/10.1609/aaai.v37i9.26278

Yang, Jianyi; Ren, Shaolei (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Online optimization with multiple budget constraints is challenging since the online decisions over a short time horizon are coupled together by strict inventory constraints. The existing manually-designed algorithms cannot achieve satisfactory average performance for this setting because they often need a large number of time steps for convergence and/or may violate the inventory constraints. In this paper, we propose a new machine learning (ML) assisted unrolling approach, called LAAU (Learning-Assisted Algorithm Unrolling), which unrolls the agent’s online decision pipeline and leverages an ML model for updating the Lagrangian multiplier online. For efficient training via backpropagation, we derive gradients of the decision pipeline over time. We also provide the average cost bounds for two cases when training data is available offline and collected online, respectively. Finally, we present numerical results to highlight that LAAU can outperform the existing baselines.
more » « less
Full Text Available
Achieving Certified Robustness for Brain-Inspired Low-Dimensional Computing Classifiers

https://doi.org/10.1109/INFOCOMWKSHPS57453.2023.10225774

Yang, Fangfang; Duan, Shijin; Xu, Xiaolin; Ren, Shaolei (May 2023, IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS))

Full Text Available
Robustified Learning for Online Optimization with Memory Costs

https://doi.org/10.1109/INFOCOM53939.2023.10228998

Li, Pengfei; Yang, Jianyi; Ren, Shaolei (May 2023, IEEE Conference on Computer Communications)

Full Text Available
Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees

Pengfei Li, Jianyi Yang (January 2023, International Conference on Machine Learning)

Many problems, such as online ad display, can be formulated as online bipartite matching. The crucial challenge lies in the nature of sequentially-revealed online item information, based on which we make irreversible matching decisions at each step. While numerous expert online algorithms have been proposed with bounded worst-case competitive ratios, they may not offer satisfactory performance in average cases. On the other hand, reinforcement learning (RL) has been applied to improve the average performance, but it lacks robustness and can perform arbitrarily poorly. In this paper, we propose a novel RL-based approach to edge-weighted online bipartite matching with robustness guarantees (LOMAR), achieving both good average-case and worst-case performance. The key novelty of LOMAR is a new online switching operation which, based on a judicious condition to hedge against future uncertainties, decides whether to follow the expert’s decision or the RL decision for each online item. We prove that for any rho in [0,1], LOMAR is rho-competitive against any given expert online algorithm. To improve the average performance, we train the RL policy by explicitly considering the online switching operation. Finally, we run empirical experiments to demonstrate the advantages of LOMAR compared to existing baselines.
more » « less
Full Text Available
Informed Learning by Wide Neural Networks: Convergence, Generalization and Sampling Complexity

Yang, Jianyi; Ren, Shaolei (July 2022, Proceedings of the 39th International Conference on Machine Learning)

By integrating domain knowledge with labeled samples, informed machine learning has been emerging to improve the learning performance for a wide range of applications. Nonetheless, rigorous understanding of the role of injected domain knowledge has been under-explored. In this paper, we consider an informed deep neural network (DNN) with over-parameterization and domain knowledge integrated into its training objective function, and study how and why domain knowledge benefits the performance. Concretely, we quantitatively demonstrate the two benefits of domain knowledge in informed learning --- regularizing the label-based supervision and supplementing the labeled samples --- and reveal the trade-off between label and knowledge imperfectness in the bound of the population risk. Based on the theoretical analysis, we propose a generalized informed training objective to better exploit the benefits of knowledge and balance the label and knowledge imperfectness, which is validated by the population risk bound. Our analysis on sampling complexity sheds lights on how to choose the hyper-parameters for informed learning, and further justifies the advantages of knowledge informed learning.
more » « less
Full Text Available
Improving QoE of Deep Neural Network Inference on Edge Devices: A Bandit Approach

https://doi.org/10.1109/JIOT.2022.3182728

Lu, Bingqian; Yang, Jianyi; Xu, Jie; Ren, Shaolei (June 2022, IEEE Internet of Things Journal)

Full Text Available

« Prev Next »

Search for: All records