VERSION: AMD ZEN HPCG AVX2 2023-01
DEPENDENCIES:
- This binary executable was built with AVX2 support, and will only run properly on systems that support AVX2 instructions:
- Specifically: AMD “Zen3” - based processors such as the AMD 3rd Generation EPYC™ CPUs.
- The binary is NOT intended to produce optimal results on other platforms.
- The binary was built on Red Hat® Enterprise Linux® 8.6 and should only be expected to run on such systems. However, it has also been tested on Red Hat® Enterprise Linux® 9.
- OpenMPI 4: This binary was built against OpenMPI 4.1.4 and should run without issue as long as OpenMPI 4 is in the PATH.
RECOMMENDED SETTINGS:
- Boost: ON
- Transparent Hugepages: always
- SMT: OFF
- NPS: 4
- Determinism: Power
HOW TO RUN:
- Ensure OpenMPI 4.x is installed and loaded in your environment.
- Place the supplied hpcg.dat file in the same directory as the AMD Zen HPCG binary. Modify hpcg.dat as required.
- By default, hpcg.dat will define a very small problem, where the 2nd line represents values of nx, ny, and nz, respectively and the 3rd line represents the runtime. For valid benchmark runs, the problem size should use at least 1/4th of the total available main memory and the runtime should be at least 1800 seconds.
- Alternatively, you may pass these arguments in the command line:
- Example: –nx=<value> –ny=<value> –nz=<value> –rt=<value>
Note: These parameters will override the values set in hpcg.dat.
- Example: –nx=<value> –ny=<value> –nz=<value> –rt=<value>
- Example Run Command for Single Node:
- For a short run on Dual Socket System with 64 Cores/socket and 512GB RAM:
mpirun -np 32 --bind-to core --map-by ppr:2:l3cache:pe=2 -x OMP_NUM_THREADS=2 -x OMP_PROC_BIND=true -x OMP_PLACES=cores ./amd_zen_hpcg_avx2_2023_01 --nx=192 --ny=192 --nz=192 --rt=60
- For a short run on Dual Socket System with 64 Cores/socket and 512GB RAM: