Impact of collective tuning for applications using MPI+OpenMP programming model

Application execution time with 512 cores on Stampede

Application execution time with 512 cores on Stampede

Library Version: MVAPICH2 2.2b

Runtime Flags: The appropriate tuning parameters for hybrid MPI+OpenMP programming models is enabled by default starting from MVAPICH2-2.2b onward

System Details: Stampede@ TACC: Sandybridge architecture with dual 8-cores nodes and ConnectX-3 FDR InfiniBand interconnect

Tags: Lulesh


Submitted by Jerome Vienne and Carlos Rosales-Fernandez @ TACC

Last Modified April 19, 2016, 10:19 a.m.