The number of downloads from the MVAPICH site has crossed a quarter million (250,000). The MVAPICH team would like to thank all its users and organizations!!

MVAPICH2 drives 8th ranked 5+ Petaflop TACC Stampede system with 519,640 cores, InfiniBand FDR and Intel MIC [more]

Welcome to this web page related to "MPI over InfiniBand, 10GigE/iWARP and RDMA over Converged Ethernet (RoCE)" project, lead by Network-Based Computing Laboratory (NBCL) of The Ohio State University. The MVAPICH2 software, supporting MPI 3.0 standard, delivers best performance, scalability and fault tolerance for high-end computing systems and servers using InfiniBand, 10GigE/iWARP and RoCE networking technologies. This software is being used by more than organizations world-wide in 76 countries (Current Users) to extract the potential of these emerging networking technologies for modern systems. As of , more than downloads have taken place from this project's site. This software is also being distributed by many InfiniBand, 10GigE/iWARP and RoCE vendors in their software distributions. The MVAPICH2-X software package provides support for hybrid MPI+PGAS (UPC and OpenSHMEM) programming models with unified communication runtime for emerging exascale systems. The MVAPICH2-GDR package provides support for clusters with NVIDIA GPUs supporting the GPUDirect RDMA feature. The MVAPICH2-Virt package provides support for high performance and scalable MPI in a cloud computing environment with InfiniBand and SR-IOV. The MVAPICH2-MIC package provides support for clusters with Intel MIC coprocessors. MVAPICH2 software is powering several supercomputers in the TOP 500 list. Examples (from the July '15 ranking) include:

  • 8th, 519,640-core (Stampede) at TACC
  • 11th, 185,344-core (Pleiades) at NASA
  • 22nd, 76,032-core (Tsubame 2.5) at Tokyo Institute of Technology

This project is supported by funding from U.S. National Science Foundation, U.S. DOE Office of Science, Ohio Board of Regents, ODOD, Cisco Systems, Cray, Intel, Linux Networx, Mellanox, NVIDIA, QLogic, and Sun Microsystems; and equipment donations from Advanced Clustering, AMD, Appro, Chelsio, Dell, Fulcrum, Fujitsu, Intel, Mellanox, Microway, NetEffect, QLogic and Sun. Other technology partner includes: TotalView Technologies.


(NEW) Upcoming Tutorial: IB and HSE at at SC '15. Sneak preview here.

MVAPICH2-Virt 2.1 GA (based on MVAPICH2 2.1 GA) with support for efficient MPI communication over SR-IOV enabled InfiniBand network, integration with OpenStack, high-performance and locality-aware MPI communication with IVSHMEM, automatic communication channel selection among SR-IOV, IVSHMEM and CMA/LiMIC2 is available. [more]

OSU InfiniBand Network Analysis and Monitoring (INAM) Tool 0.8 with support for analyzing and profiling network-level activities with many parameters (data and errors), node-level, job-level and process-level activities for MPI communication (Point-to-Point, Collectives and RMA) with MVAPICH2-X, remote monitoring of CPU utilization is available. [more]

MVAPICH2-EA (Energy-Aware) 2.1 with energy-efficient support for IB, RoCE and iWARP, user defined energy-performance trade-off levels, and compatibility with OEMT is is available. [more]

OSU Energy Management Tool (OEMT) 0.8 to measure the energy consumption of MPI applications is available. [more]

MVAPICH2 2.2a (based on MPICH 3.1.4) with minimized memory footprint, HCA-aware process mapping, intra-node in RoCE mode without shared memory, and hwloc 1.11.0 is available. [more]

MVAPICH2-X 2.2a providing support for advanced MPI features (Dynamic Connected (DC) transport and non-blocking collectives with Core-Direct), RoCE and DC for OpenSHMEM, UPC and CAF, hybrid MPI+PGAS (UPC, OpenSHMEM and CAF) programming models, and support for INAM is available. [more]

OMB 5.0 with support for non-blocking collectives (iallgather, ialltoall, ibarrier, ibcast, igather and iscatter), startup benchmarks and several enhancements is available. [more]

MVAPICH2-GDR 2.1rc2 (based on MVAPICH2 2.1rc2) with CUDA 7.0 compatibility, CUDA-Aware support for MPI_Rsend and MPI_Irsend primitives, added Parallel intranode communication channels, Optimized H-H, H-D, D-H, and intranode D-D communication along with tuning for point-point and collective operations, Update to sm_20 kernel optimization for Datatype processing [more]

MVAPICH2-MIC 2.0 (based on MVAPICH2 2.0.1) with optimized pt-to-pt and collective support for native, symmetric and offload modes on clusters with Intel MICs (Xeon Phis) is available. [more]