skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on December 1, 2025

Title: Methodology to Characterize Row Manifolds for High Power Direct to Chip Liquid Cooling Data Centers
Abstract Demand is growing for the dense and high-performing IT computing capacity to support artificial intelligence, deep learning, machine learning, autonomous cars, the Internet of Things, etc. This led to an unprecedented growth in transistor density for high-end CPUs and GPUs, creating thermal design power (TDP) of even more than 700 watts for some of the NVIDIA existing GPUs. Cooling these high TDP chips with air cooling comes with a cost of the higher form factor of servers and noise produced by server fans close to the permissible limit. Direct-to-chip cold plate-based liquid cooling is highly efficient and becoming more reliable as the advancement in technology is taking place. Several components are used in the liquid-cooled data centers for the deployment of cold plate-based direct-to-chip liquid cooling like cooling loops, rack manifolds, CDUs, row manifolds, quick disconnects, flow control valves, etc. Row manifolds used in liquid cooling are used to distribute secondary coolant to the rack manifolds. Characterizing these row manifolds to understand the pressure drops and flow distribution for better data center design and energy efficiency is important. In this paper, the methodology is developed to characterize the row manifolds. Water-based coolant Propylene glycol 25% was used as the coolant for the experiments and experiments were conducted at 21 °C coolant supply temperature. Two, six-port row manifolds' P-Q curves were generated, and the value of supply pressure and the flowrate were measured at each port. The results obtained from the experiments were validated by a technique called flow network modeling (FNM). FNM technique uses the overall flow and thermal characteristics to represent the behavior of individual components.  more » « less
Award ID(s):
2209751
PAR ID:
10537457
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Publisher / Repository:
American Society of Mechanical Engineers
Date Published:
Journal Name:
Journal of Electronic Packaging
Volume:
146
Issue:
4
ISSN:
1043-7398
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Due to the increasing computational demand driven by artificial intelligence, machine learning, and the Internet of Things (IoT), there has been an unprecedented growth in transistor density for high-end CPUs and GPUs. This growth has resulted in high thermal dissipation power (TDP) and high heat flux, necessitating the adoption of advanced cooling technologies to minimize thermal resistance and optimize cooling efficiency. Among these technologies, direct-to-chip cold plate-based liquid cooling has emerged as a preferred choice in electronics cooling due to its efficiency and cost-effectiveness. In this context, different types of single-phase liquid coolants, such as propylene glycol (PG), ethylene glycol (EG), DI water, treated water, and nanofluids, have been utilized in the market. These coolants, manufactured by different companies, incorporate various inhibitors and chemicals to enhance long-term performance, prevent biogrowth, and provide corrosion resistance. However, the additives used in these coolants can impact their thermal performance, even when the base coolant is the same. This paper aims to compare these coolant types and evaluate the performance of the same coolant from different vendors. The selection of coolants in this study is based on their performance, compatibility with wetted materials, reliability during extended operation, and environmental impact, following the guidelines set by ASHRAE. To conduct the experiments, a single cold plate-based benchtop setup was constructed, utilizing a thermal test vehicle (TTV), pump, reservoir, flow sensor, pressure sensors, thermocouple, data acquisition units, and heat exchanger. Each coolant was tested using a dedicated cold plate, and thorough cleaning procedures were carried out before each experiment. The experiments were conducted under consistent boundary conditions, with a TTV power of 1000 watts and varying coolant flow rates (ranging from 0.5 lpm to 2 lpm) and supply coolant temperatures (17°C, 25°C, 35°C, and 45°C), simulating warm water cooling. The thermal resistance (Rth) versus flow rate and pressure drop (ΔP) versus flow rate graphs were obtained for each coolant, and the impact of different supply coolant temperatures on pressure drop was characterized. The data collected from this study will be utilized to calculate the Total Cost of Ownership (TCO) in future research, providing insights into the impact of coolant selection at the data center level. There is limited research available on the reliability used in direct-to-chip liquid cooling, and there is currently no standardized methodology for testing their reliability. This study aims to fill this gap by focusing on the reliability of coolants, specifically propylene glycols at concentrations of 25%. To analyze the effectiveness of corrosion inhibitors in these coolants, ASTM standard D1384 apparatus, typically used for testing engine coolant corrosion inhibitors on metal samples in controlled laboratory settings, was employed. The setup involved immersing samples of wetted materials (copper, solder coated brass, brass, steel, cast iron, and cast aluminum) in separate jars containing inhibited propylene glycol solutions from different vendors. This test will determine the reliability difference between the same inhibited solutions from different vendors. 
    more » « less
  2. Abstract The increasing demand for high-performance computing in applications such as the Internet of Things, Deep Learning, Big data for crypto-mining, virtual reality, healthcare research on genomic sequencing, cancer treatment, etc. have led to the growth of hyperscale data centers. To meet the cooling energy demands of HPC datacenters efficient cooling technologies must be adopted. Traditional air cooling, direct-to-chip liquid cooling, and immersion are some of those methods. Among all, Liquid cooling is superior compared to various air-cooling methods in terms of energy consumption. Direct on-chip cooling using cold plate technology is one such method used in removing heat from high-power electronic components such as CPUs and GPUs in a broader sense. Over the years Thermal Design Power (TDP) is rapidly increasing and will continue to increase in the coming years for not only CPUs and GPUs but also associated electronic components like DRAMs, Platform Control Hub (PCH), and other I/O chipsets on a typical server board. Therefore, unlike air hybrid cooling which uses liquid for cold plates and air as the secondary medium of cooling the associated electronics, we foresee using immersion-based fluids to cool the rest of the electronics in the server. The broader focus of this research is to study the effects of adopting immersion cooling, with integrated cold plates for high-performance systems. Although there are several other factors involved in the study, the focus of this paper will be the optimization of cold plate microchannels for immersion-based fluids in an immersion-cooled environment. Since immersion fluids are dielectric and the fluids used in cold plates are conductive, it exposes us to a major risk of leakage into the tank and short-circuiting the electronics. Therefore, we propose using the immersed fluid to pump into the cold plate. However, it leads to a suspicion of poor thermal performance and associated pumping power due to the difference in viscosity and other fluid properties. To address the thermal and flow performance, the objective is to optimize the cold plate microchannel fin parameters based on thermal and flow performance by evaluating thermal resistance and pressure drop across the cold plate. The detailed CFD model and optimization of the cold plate were done using Ansys Icepak and Ansys OptiSLang respectively. 
    more » « less
  3. In response to the exponential growth of online platforms and the rise of web-based Artificial Intelligence (AI), the demand for computational power and the expansion of data centers have surged significantly. This trend necessitates advanced cooling strategies and heightened energy efficiency to address the increasing power densities of Information Technology (IT) equipment and the consequent rise in energy consumption. Consequently, there is a significant pivot towards efficient cooling mechanisms that emphasize thermal management and energy efficiency. Against this backdrop, our study thoroughly evaluates a two-phase direct-to-chip liquid cooling system's ability to effectively manage and dissipate heat in high-density rack environments. Central to our research is the deployment of a highly efficient Refrigerant-to-Liquid (R2L) Coolant Distribution Unit (CDU) across multi-racks, which face high thermal demands. This innovative system, featuring an in-row pumped two-phase CDU with a cooling capacity of 160 kW, is intricately integrated with row and rack manifolds and server cooling loops to ensure optimal cooling performance. To accurately simulate the thermal loads encountered in real-world data center operations, the study employs Thermal Testing Vehicles (TTVs). These 3U TTVs are equipped with 2.5 kW heaters, covering an extensive area of 2500 mm², thereby effectively replicating server thermal loads up to 10 kW. The investigation starts with a detailed description of the system's design and continues with the commissioning process. This process includes extensive hydraulic and thermal testing, along with a comprehensive assessment of the impact of pressure drops across the system, focusing on supply manifolds, cooling loops, dry breaks, and return manifolds, utilizing Cooling Loops (CLs) each containing four Cold Plates (CPs). The study culminates in the analysis of experimental data from heating the TTVs, focusing on the efficiency of two-phase cooling in transferring heat from the TTVs to chilled water using R134a refrigerant as the performance benchmark. Future directions include exploring eco-friendly cooling practices by investigating alternative green refrigerants with low Global Warming Potential (GWP) to replace R134a, aligning with global sustainability goals and the imperative to reduce greenhouse gas emissions. The observed maximum values were calculated at a specific volumetric flow rate of 0.48 LPM/kW and a Tcase as low as 56.4 °C was achieved. These results demonstrate the system's capability to significantly enhance thermal management in data centers, tackle the challenges presented by high-power density chips, and encourage broader adoption of two-phase cooling technologies as a sustainable strategy for thermal regulation in the face of increasing computational demands. 
    more » « less
  4. Owing to the dramatic increase in IT power density and energy consumption, the data center (DC) sector has started adopting thermally- and energy-efficient liquid cooling methods. This study examines a single-phase direct-to-chip liquid cooling approach for three high-heat-density racks, utilizing two liquid-to-air (L2A) cooled coolant distribution units (CDUs) and a combined total heat load of 128 kW. An experimental setup was developed to test different types of CDUs, cooling loops, and thermal testing vehicles (TTVs) for different operating conditions. IR images and the collected data were used to investigate the effect of air recirculation between cold and hot aisle containments on the CDU’s performance and stability of supply air temperature (SAT). Three different types of cooling loops (X, Y, and Z) were characterized thermally and hydraulically. Results show that Type Y has the lowest cold plate thermal resistance and pressure drop, among others. In a later test that included a single rack at a heat load of 53 kW and a single CDU, the heat capture ratio for fluid was found to be 94%. Experiments show that using blanking panels on the back of the racks limits hot air recirculation and maintains a steady SAT in the cold aisle. Finally, the CDU performance was evaluated at a high heat load for the three racks at 128 kW, and the average cooling capacity of the units is 58.6 kW, and the effectiveness values for CDU 1 and CDU 2 are 0.83 and 0.82, respectively. 
    more » « less
  5. Abstract Direct Liquid Cooling (DLC) has emerged as a promising technology for thermal management of high-performance computing servers, enabling efficient heat dissipation and reliable operation. Thermal performance is governed by several factors, including the coolant physical properties and flow parameters such as coolant inlet temperature and flow rate. The design and development of the coolant distribution manifold to the Information Technology Equipment (ITE) can significantly impact the overall performance of the computing system. This paper aims to investigate the hydraulic characterization and design validation of a rack-level coolant distribution manifold or rack manifold. To achieve this goal, a custom-built high power-density liquid-cooled ITE rack was assembled, and various cooling loops were plugged into the rack manifold to validate its thermal performance. The rack manifold is responsible for distributing the coolant to each of these cooling loops, which is pumped by a CDU (Coolant Distribution Unit). In this study, pressure drop characteristics of the rack manifold were obtained for flow rates that effectively dissipate the heat loads from the ITE. The pressure drop is a critical parameter in the design of the coolant distribution manifold since it influences the flow rate and ultimately the thermal performance of the system. By measuring the pressure drop at various flow rates, the researchers can accurately determine the optimum flow rate for efficient heat dissipation. Furthermore, 1D flow network and CFD models of the rack-level coolant loop, including the rack manifold, were developed, and validated against experimental test data. The validated models provide a useful tool for the design of facility-level modeling of a liquid-cooled data center. The CFD models enable the researchers to simulate the fluid flow and heat transfer within the cooling system accurately. These models can help to design the coolant distribution manifold at facility level. The results of this study demonstrate the importance of the design and development of the coolant distribution manifold in the thermal performance of a liquid-cooled data center. The study also highlights the usefulness of 1D flow network and CFD models for designing and validating liquid-cooled data center cooling systems. In conclusion, the hydraulic characterization and design validation of a rack-level coolant distribution manifold is critical in achieving efficient thermal management of high-performance computing servers. This study presents a comprehensive approach for hydraulic characterization of the coolant distribution manifold, which can significantly impact the overall thermal performance and reliability of the system. The validated models also provide a useful tool for the design of facility-level modeling of a liquid-cooled data center. 
    more » « less