The enormous growth of AI computing has led to a surging demand for electricity. To stem the resulting energy cost and environmental impact, this paper explores opportunities enabled by the increasing hardware heterogeneity and introduces the concept of Geographical Server Relocation (GSR). Specifically, GSR physically balances the available AI servers across geographically distributed data centers subject to AI computing demand and power capacity constraints in each location. The key idea of GSR is to relocate older and less energy-efficient servers to regions with more renewables, better water efficiencies and/or lower electricity prices. Our case study demonstrates that, even with modest flexibility of relocation, GSR can substantially reduce the total operational environmental footprints and operation costs of AI computing. We conclude this paper by discussing major challenges of GSR, including service migration, software management, and algorithms.
more »
« less
Geographical Server Relocation: Opportunities and Challenges
The enormous growth of AI computing has led to a surging demand for electricity. To stem the resulting energy cost and environmental impact, this paper explores opportunities enabled by the increasing hardware heterogeneity and introduces the concept of Geographical Server Relocation (GSR). Specifically, GSRphysicallybalances the available AI servers across geographically distributed data centers subject to AI computing demand and power capacity constraints in each location. The key idea of GSR is to relocate older and less energy-efficient servers to regions with more renewables, better water efficiencies and/or lower electricity prices. Our case study demonstrates that, even with modest flexibility of relocation, GSR can substantially reduce the total operational environmental footprints and operation costs of AI computing. We conclude this paper by discussing major challenges of GSR, including service migration, software management, and algorithms.
more »
« less
- PAR ID:
- 10661264
- Publisher / Repository:
- ACM SIGEnergy Energy Informatics Review
- Date Published:
- Journal Name:
- ACM SIGEnergy Energy Informatics Review
- Volume:
- 4
- Issue:
- 5
- ISSN:
- 27705331
- Page Range / eLocation ID:
- 34-43
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Since the advent of widely accessible AI tools, AI technology has been in high demand by businesses, academic researchers and individuals. Technology companies are building AI infrastructure at a rapid pace, and these facilities consume vast and growing resources, particularly electricity and water, with significant real and projected climate impacts. There is a need for new research initiatives to support long time horizon efforts to develop energy efficient computing capabilities to support the continued growth of AI infrastructure in a sustainable fashion. Such efficiency is required at both the hardware and software levels. Where can industry turn for examples of ultra-low power, energy efficient computing? We argue here that neurobiological principles offer rich and under-exploited sources of inspiration for energy efficient NeuroAI, and that new partnerships between industry and academia should be developed in this directionmore » « less
-
Since the advent of widely accessible AI tools, AI technology has been in high demand by businesses, academic researchers and individuals. Technology companies are building AI infrastructure at a rapid pace, and these facilities consume vast and growing resources, particularly electricity and water, with significant real and projected climate impacts. There is a need for new research initiatives to support long time horizon efforts to develop energy efficient computing capabilities to support the continued growth of AI infrastructure in a sustainable fashion. Such efficiency is required at both the hardware and software levels. Where can industry turn for examples of ultra-low power, energy efficient computing? We argue here that neurobiological principles offer rich and under-exploited sources of inspiration for energy efficient NeuroAI, and that new partnerships between industry and academia should be developed in this directionmore » « less
-
Smart Servers, Smarter Speed Scaling: A Decentralized Algorithm for Data Center Efficiency A team of researchers from Georgia Tech and the University of Minnesota has introduced a cutting-edge algorithm designed to optimize energy use in large-scale data centers. As detailed in their paper “Distributed Rate Scaling in Large-Scale Service Systems,” the team developed a decentralized method allowing each server to adjust its processing speed autonomously without the need for communication or knowledge of system-wide traffic. The algorithm uses idle time as a local signal to guide processing speed, ensuring that all servers converge toward a globally optimal performance rate. This innovation addresses a critical issue in modern computing infrastructure: balancing energy efficiency with performance under uncertainty and scale. The authors demonstrate that their approach not only stabilizes the system but achieves asymptotic optimality as the number of servers increases. The work is poised to significantly reduce energy consumption in data centers, which are projected to account for up to 8% of U.S. electricity use by 2030.more » « less
-
Data centers' energy consumption is expected to rise significantly over the next five years due to the accelerated growth of AI and its computational demands. Simultaneously, initiatives are underway to mitigate the environmental impact of the energy usage by replacing brown energy sources (e.g., coal) with green energy sources (e.g., solar), resulting in a projected decrease in grid carbon intensity. Yet, the interplay between these two contrasting forces on data centers' carbon emissions — a function of both energy consumption and carbon intensity — has not received much attention. In this paper, we analyze the operational carbon emissions of data centers at this crossroads, by considering the increasing data center energy consumption and the decarbonization of the electricity grid. In particular, we integrate publicly available current and future energy projections, consider multiple future scenarios, and provide a 5-year projection of datacenter carbon emissions at the global, US, and state levels. Our analysis shows that over the next five years, the rate of data center demand growth will overshadow the rate of grid decarbonization, with global (resp. US) emissions projected to rise by 4.2× (resp. 4.1×) in the worst case by 2030. Moreover, we observe considerable regional differences within the US, where emissions in some states could increase by up to 3.4× by 2030.more » « less
An official website of the United States government

