Optimizing the Performance of SPEC MPI with Skylake and Omni-Path
Optimizing the Performance of SPEC MPI with Skylake and Omn-Path
Tuning the Eager threshold has a significant impact on application performance by avoiding the synchronization of rendezvous protocol and thus yielding better communication computation overlap.
Library Version: MVAPICH2 2.3rc1
Runtime Flags:
milc | MV2_SMP_EAGERSIZE=256000 |
leslie3d | None |
pop2 | None |
lammps | MV2_SMP_EAGERSIZE=1024000 |
wrf2 | MV2_SMP_EAGERSIZE=128000 |
GAPgeofem | None |
tera_tf | MV2_SMP_EAGERSIZE=128000 |
lu | MV2_SMP_EAGERSIZE=1024000 |
System Details:
CPU Model | Intel Xeon Platinum 8160 |
CPU Core Info | 2 x 24 @ 2.1 GHz |
IB Card | 100Gb/sec Intel Omni-Path (OPA) |
IB Switch | Full Fat Tree 100Gb/sec Intel Omni-Path with 6 core switches |
Tags: MILC SpecMPI Parallel Ocean Program 2 (pop2) LAMMPS Molecular Dynamics Simulator WRF2 GAPgeofem TERA_TF LU
Submitted by Amit Ruhela @ OSU
Last Modified March 6, 2018, 1:53 p.m.