25:00
Focus
Sign in to save your learning paths. Guest paths may be lost if you clear your browser data.Sign in
Lesson 7

Cooling Technologies and Chiller Plants

~12 min100 XP

Introduction

Modern data centers are essentially massive heat engines, consuming significant electricity to power high-density computing hardware. You will discover how architects manage the thermal output of thousands of servers through sophisticated cooling architectures and the thermodynamic principles that govern chiller plant operations.

Rack-Level Thermal Management: CRAC and CRAH

To prevent thermal throttling or hardware failure, data centers utilize Computer Room Air Conditioning (CRAC) or Computer Room Air Handling (CRAH) units. While often used interchangeably, their underlying mechanisms differ: a CRAC unit contains a self-contained refrigeration cycle (compressor and refrigerant), whereas a CRAH unit utilizes chilled water supplied by a central plant to cool room air.

The architecture of these systems relies on the hot aisle/cold aisle arrangement. By aligning server racks so that intakes face one another (forming a cold aisle) and exhausts face one another (forming a hot aisle), engineers create a repeatable pattern that allows for efficient airflow management.

A common pitfall in these designs is "air bypass," where cold air flows directly from the supply floor to the return intake without passing through the servers. This is corrected by installing blanking panels in unused rack spaces to force air through the equipment.

Exercise 1Multiple Choice
What is the primary objective of the hot aisle/cold aisle arrangement?

Chiller Plants and the Refrigeration Cycle

At the heart of the facility’s thermal infrastructure is the chiller plant. Large data centers often use centrifugal or screw chillers to produce chilled water. These chillers operate based on the vapor-compression cycle, which uses the phase change of a refrigerant to move heat.

The efficiency of a chiller is measured by its COP (Coefficient of Performance) or kW/Ton. The relationship is defined as: COP=Cooling OutputElectrical InputCOP = \frac{\text{Cooling Output}}{\text{Electrical Input}} As the temperature difference between the chilled water supply and the outside ambient air decreases, efficiency increases. This allows for economization (or "free cooling"), where the chiller is bypassed entirely, and outside air or water is used directly to reject heat into the atmosphere.

Note: Always monitor the approach temperature—the difference between the leaving chilled water temperature and the refrigerant temperature—as a degraded approach temperature indicates fouled heat exchanger tubes, leading to significant efficiency loss.

Liquid Cooling Architectures

As CPU and GPU TDP (Thermal Design Power) ratings increase, traditional air cooling reaches an efficiency wall. To mitigate this, engineers are transitioning to Direct-to-Chip (D2C) cooling or Immersion Cooling.

  1. D2C: Specialized cold plates are mounted directly onto high-heat processors, circulating a dielectric fluid or water-glycol mixture to extract heat at the source.
  2. Immersion Cooling: Servers are submerged in a thermally conductive, electrically insulating fluid (dielectric fluid). This provides superior heat transfer coefficients compared to air, allowing for massive increases in server rack density.
Exercise 2Fill in the Blank
___ cooling is a method where servers are submerged in a thermally conductive but electrically non-conductive fluid.

Environmental Controls and Humidity Standards

Temperature is only one part of the equation; humidity management is critical. If air is too dry, static discharge becomes a significant risk to motherboards and sensitive components. If air is too humid, condensation can cause corrosion or short circuits.

ASHRAE (American Society of Heating, Refrigerating and Air-Conditioning Engineers) provides the Psychrometric Chart standards for data centers. The recommended operational conditions typically hover between 18C18^{\circ}C and 27C27^{\circ}C with strict dew point limits.

Exercise 3True or False
High humidity is beneficial in data centers because it improves the cooling capacity of the air.

Key Takeaways

  • Airflow Management: Utilize physical containment and blanking panels to prevent air bypass, ensuring cooling capacity is directed at the hardware.
  • Thermodynamic Efficiency: Maximize the cooling cycle's efficiency by leveraging external ambient conditions for economization, reducing the reliance on compressor-based cooling.
  • Advanced Cooling: Move toward Direct-to-Chip or Immersion cooling to support hardware that exceeds the thermal limits of air-cooled environments.
  • Environmental Balance: Maintain humidity levels within ASHRAE guidelines to prevent both electrostatic discharge and moisture-related hardware corrosion.
Finding tutorial videos...
Go deeper
  • What is the main difference between CRAC and CRAH units?🔒
  • Why is a larger temperature delta more efficient for CRAH?🔒
  • How do blanking panels prevent air bypass?🔒
  • What are the benefits of hot aisle containment?🔒
  • Does the return plenum use negative pressure to pull air?🔒

Cooling Technologies and Chiller Plants — Data Centers | crescu