There are various designs for segregating hot and cold air in data centers such as cold aisle containment (CAC), hot aisle containment (HAC), and chimney exhaust rack. These containment systems have different characteristics and impose various conditions on the information technology equipment (ITE). One common issue in HAC systems is the pressure buildup inside the HAC (known as backpressure). Backpressure also can be present in CAC systems in case of airflow imbalances. Hot air recirculation, limited cooling airflow rate in servers, and reversed flow through ITE with weaker fan systems (e.g. network switches) are some known consequences of backpressure. Currently there is a lack of experimental data on the interdependency between overall performance of ITE and its internal design when a backpressure is imposed on ITE. In this paper, three commercial 2-rack unit (RU) servers with different internal designs from various generations and performance levels are tested and analyzed under various environmental conditions. Smoke tests and thermal imaging are implemented to study the airflow patterns inside the tested equipment. In addition, the impact leak of hot air into ITE on the fan speed and the power consumption of ITE is studied. Furthermore, the cause of the discrepancy between measured inlet temperatures by internal intelligent platform management interface (IPMI) and external sensors is investigated. It is found that arrangement of fans, segregation of space upstream and downstream of fans, leakage paths, location of sensors of baseboard management controller (BMC) and presence of backpressure can have a significant impact on ITE power and cooling efficiency.
more »
« less
Impact of Fans Location on the Cooling Efficiency of IT Servers
The role of data centers in modern life has expanded rapidly over the past decades. In addition, this expansion has resulted in a significant increase in the share of data centers in total energy consumption of the world. Thus, reliability and energy efficiency have become a common concern in data centers. Information technology equipment (ITE) and cooling infrastructure are the largest power consumer in data centers. The cooling power in a data center depends on the amount of heat dissipated by ITE. Therefore, the thermal design of the ITE impacts not only the ITE power but also affects the infrastructure power and has a significant role in the overall efficiency of data centers. This paper studies the impact of fans location and airflow balancing on the thermal performance and power of a server. A detailed computational fluid dynamic (CFD) model of the server is built, calibrated and validated using experimental test results. Next, impacts of moving fans to the rear side of the chassis on the flow rate and temperature of components are investigated. Special attention is given to controlling airflow through power supplies.
more »
« less
- Award ID(s):
- 1738793
- PAR ID:
- 10095742
- Date Published:
- Journal Name:
- ITHERM
- ISSN:
- 1936-3958
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract The practice of commissioning data centers (DCs) is necessary to confirm the compliance of the cooling system to the information technology equipment (ITE) load (design capacity). In a typical DC, there are different types of ITE, each having its physical characteristics. Considering these geometrical and internal differences among ITE, it is infeasible to use the actual ITE as a self-simulator. Hence, a separate device called load bank is employed for that purpose. Load banks create a dummy thermal load to analyze, test, and stress the cooling infrastructure. Available commercial load banks do not accurately replicate a server's airflow patterns and transient heat signatures which are governed by thermal inertia, energy dissipation, flow resistance, and fan system behavior. In this study, a novel prototype of the server called server simulator was designed and built with different components to be used as a server mockup. The server simulator accurately captured air resistance, heat dissipation, and the functionality of actual server behavior. Experimental data showed up to 93% improvement in ITE passive and active flow curves using the designed server simulator compared to the commercial load bank. Furthermore, the experimental results demonstrated a below 5% discrepancy on the critical back pressure and free delivery point between the actual ITE and the designed server simulator. In addition, experimental data indicated that the developed server simulator improved the actual ITE thermal mass by 27% compared to the commercial load bank.more » « less
-
Immersion cooling has emerged as a promising solution for the escalating thermal management challenges in the contemporary and modern (next generation) data centers, where traditional air-cooling systems are increasingly inadequate due to rising power densities in microprocessors. The evolution of immersion cooling technologies, highlighting their benefits and the challenges associated with their implementation, is explored in this review. Two-phase microchannel cooling has often been cited as a highly efficient alternative, which can help achieve significant energy savings over air cooling. Subsequent studies have expanded on methods like refrigerant touch cooling, thermosiphon systems, and geothermal immersion cooling. All these strategies can help achieve enhanced energy efficiency, reduced operational costs, and improved Power Usage Effectiveness (PUE). Immersion cooling has been demonstrated to support denser CPU packaging and meet the demands of high-performance computing environments. Despite these advantages, challenges persist, including the need for specialized infrastructure, potential risks related to liquid handling, and integration with existing systems. Advances in environmentally friendly cooling fluids and optimized airflow management have begun to mitigate some of these issues. To fully realize the potential of immersion cooling, future research endeavors should focus on developing standardized protocols and best practices to facilitate its widespread adoption. Enhancing the compatibility of cooling fluids with a broader range of hardware components will be crucial, as will designing systems that are easier to integrate with existing data center infrastructure. Exploring hybrid cooling solutions that combine immersion cooling with other efficient methods could offer additional benefits. Further investigation into the long-term reliability, maintenance requirements, and environmental impacts of immersion-cooled systems is essential. Integrating immersion cooling with renewable energy sources and waste heat recovery systems could also enhance sustainability and operational efficiency (e.g., for thermal desalination and wood drying applications). The literature reports can be utilized to identify a clear trend toward adopting immersion cooling as a key strategy for improving energy efficiency and thermal management in data centers. Hence, ongoing research and development efforts need to be redirected to overcoming these remaining obstacles, thus paving the way for more energy efficient data center operations with reduced footprint for water and power consumption in the future; while also improving the sustainability, reliability, robustness, and resilience of these platforms.more » « less
-
Characterization of an Isolated Hybrid Cooled Server With Failure Scenarios Using Warm Water CoolingModern day data centers are operated at high power for increased power density, maintenance, and cooling which covers almost 2 percent (70 billion kilowatt-hours) of the total energy consumption in the US. IT components and cooling system occupy the major portion of this energy consumption. Although data centers are designed to perform efficiently, cooling the high-density components is still a challenge. So, alternative methods to improve the cooling efficiency has become the drive to reduce the cooling cost. As liquid cooling is more efficient for high specific heat capacity, density, and thermal conductivity, hybrid cooling can offer the advantage of liquid cooling of high heat generating components in the traditional air-cooled servers. In this experiment, a 1U server is equipped with cold plate to cool the CPUs while the rest of the components are cooled by fans. In this study, predictive fan and pump failure analysis are performed which also helps to explore the options for redundancy and to reduce the cooling cost by improving cooling efficiency. Redundancy requires the knowledge of planned and unplanned system failures. As the main heat generating components are cooled by liquid, warm water cooling can be employed to observe the effects of raised inlet conditions in a hybrid cooled server with failure scenarios. The ASHRAE guidance class W4 for liquid cooling is chosen for our experiment to operate in a range from 25°C – 45°C. The experiments are conducted separately for the pump and fan failure scenarios. Computational load of idle, 10%, 30%, 50%, 70% and 98% are applied while powering only one pump and the miniature dry cooler fans are controlled externally to maintain constant inlet temperature of the coolant. As the rest of components such as DIMMs & PCH are cooled by air, maximum utilization for memory is applied while reducing the number fans in each case for fan failure scenario. The components temperatures and power consumption are recorded in each case for performance analysismore » « less
-
Over the past few years, there has been an ever increasing rise in energy consumption by IT equipment in Data Centers. Thus, the need to minimize the environmental impact of Data Centers by optimizing energy consumption and material use is increasing. In 2011, the Open Compute Project was started which was aimed at sharing specifications and best practices with the community for highly energy efficient and economical data centers. The first Open Compute Server was the ‘ Freedom’ Server. It was a vanity free design and was completely custom designed using minimum number of components and was deployed in a data center in Prineville, Oregon. Within the first few months of operation, considerable amount of energy and cost savings were observed. Since then, progressive generations of Open Compute servers have been introduced. Initially, the servers used for compute purposes mainly had a 2 socket architecture. In 2015, the Yosemite Open Compute Server was introduced which was suited for higher compute capacity. Yosemite has a system on a chip architecture having four CPUs per sled providing a significant improvement in performance per watt over the previous generations. This study mainly focuses on air flow optimization in Yosemite platform to improve its overall cooling performance. Commercially available CFD tools have made it possible to do the thermal modeling of these servers and predict their efficiency. A detailed server model is generated using a CFD tool and its optimization has been done to improve the air flow characteristics in the server. Thermal model of the improved design is compared to the existing design to show the impact of air flow optimization on flow rates and flow speeds which in turn affects CPU die temperatures and cooling power consumption and thus, impacting the overall cooling performance of the Yosemite platform. Emphasis is given on effective utilization of fans in the server as compared to the original design and improving air flow characteristics inside the server via improved ducting.more » « less
An official website of the United States government

