Technical white paper

HPE DDR4 server memory performance in HPE ProLiant and HPE Synergy Gen10 servers with Intel Xeon Scalable processors

Introduction

The HPE ProLiant and Synergy Gen10 servers introduced significant memory performance advantages over their Gen9 counterparts. The Intel Skylake CPUs used in HPE Gen10 servers increased the number of memory channels from 8 to 12 in the two-socket server and from 16 to 24 channels in the four-socket servers. In both cases, the maximum DIMM data rate increased from 2400 MT/s to 2666 MT/s. The increase in memory channels and data rate contributed to a 66% growth in DDR4 memory bandwidth from Gen9 to Gen10 servers. With the addition of 2933 MT/s SmartMemory, recently introduced with Intel Cascade Lake-based processors, this grows to an 81% improvement over Gen9 memory. At the same time, many Gen10 servers continue to support two DIMMs per channel (2DPC) at the maximum allowable data rate of 2933 MT/s.
The Gen10 ROM Based Setup Utility (RBSU) made it easier to deploy the servers by introducing Workload Profiles for users to optimize the CPU, Memory, and I/O configurations with a single menu selection based on specific workloads such as high-performance compute (HPC), database, and virtualization. While RBSU Workload Profiles significantly simplifies configuring HPE ProLiant servers, RBSU continues to enable advanced users to individually configure specific features in the CPU, memory, and I/O subsections.
Hewlett Packard Enterprise continues to increase the resiliency of the HPE ProLiant and Synergy Gen10 servers that use the Intel Xeon Scalable family of processors by introducing HPE Fast Fault Tolerance. This memory reliability, accessibility, and serviceability (RAS) feature, the result of a joint collaboration between Intel and HPE, allows the server to continue running with a very small impact to memory throughput when up to two DRAMs fail on a DIMM. In previous generations, this feature required the system to boot up with a 50% reduction in memory throughput in anticipation of multiple DRAM failures. In the Gen10 servers, there is no performance loss until one DRAM fails, and even then, the reduction in maximum throughput is small.