Publications
This page lists publications from the group related to designing High Performance MPI on InfiniBand. In addition, the group is also actively engaged in other research directions (PVFS and MPI-IO, Micro-Benchmark suite, Distributed Shared Memory, ARMCI, and Datacenter) related to modern interconnects. Publications related to these research directions are also included in the corresponding links.
- High Performance MPI on IBA
- Clustered Storage Systems
- Solaris NFS over RDMA Project
- Efficient Shared Memory on High-Speed Interconnects
- Design of Scalable Data-Centers with InfiniBand
- High Performance Networking for TCP-based Applications
- High Performance Computing with Virtual Machines
- NIC-level Support for Collective Communication and Synchronization with Myrinet and Quadrics
- NIC-level Support for Quality of Service (QoS)
- Micro-benchmarks and Performance Comparision of High-Speed Interconnects
High Performance MPI on IBA
Journals
- J. Liu, A. Mamidala, A. Vishnu and D. K. Panda. Performance Evaluation of InfiniBand with PCI Express . IEEE Micro, 2005.
-
J. Liu, J. Wu, and D. K. Panda.
High Performance RDMA-Based MPI Implementation over InfiniBand.
In Int'l Journal of Parallel Programming, 2004.
-
J. Liu, B. Chandrasekaran, W. Yu, J. Wu, D. Buntinas, S. Kini, P. Wyckoff, and D. K. Panda.
Micro-Benchmark Performance Comparison of High-Speed Cluster Interconnects.
IEEE Micro, January/February, 2004.
Conferences/Workshops
-
R. Kumar, A. Mamidala,
M. Koop, G. Santhanaraman and D.K. Panda,
Lock-free Asynchronous
Rendezvous Design for MPI Point-to-Point Communication,
EuroPVM/MPI 2008, Sept. 2008.
- M. Koop, R. Kumar, and D. K. Panda, Can Software Reliability Outperform Hardware Reliability on High Performance Interconnects? A Case Study with MPI over InfiniBand, 22nd ACM International Conference on Supercomputing (ICS08), Island of Kos, Greece, June 2008. Conference Slides
- A. Mamidala, R. Kumar, D. De and D. K. Panda, MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics, Int'l Symposium on Cluster Computing and the Grid (CCGrid), Lyon, France, May 2008.
- M. Koop, T. Jones, and D. K. Panda, MVAPICH-Aptus: Scalable High-Performance Multi-Transport MPI over InfiniBand, IEEE International Parallel and Distributed Processing Symposium (IPDPS '08), Miami, Florida, April 2008. Conference Slides
- G. Santhanaraman, S. Narravula and D. K. Panda, Designing Passive Synchronization for MPI-2 One-Sided Communication to Maximize Overlap , IEEE International Parallel and Distributed Processing Symposium (IPDPS '08), Miami, Florida, April, 2008.
- R. Kumar, A. Mamidala and D. K. Panda, Scaling Alltoall Collective on Multi-core Systems, Workshop on Communication Architecture for Clusters, to be held in conjunction with IPDPS '08, Miami, Florida, April 2008. Conference Slides
- M. Koop, S. Sur and D. K. Panda, Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram, IEEE International Conference on Cluster Computing (Cluster'07), Austin, TX, September 2007. Conference Slides
- S. Narravula, A. R. Mamidala, A. Vishnu, G. Santhanaraman, and D. K. Panda, High Performance MPI over iWARP: Early Experiences, Int'l Conference on Parallel Processing, XiAn, China, September 2007. Conference Slides
- S. Sur, M. Koop, L. Chai and D. K. Panda, Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms, 15th Symposium on Hot Interconnects, August 2007. Conference Slides
- M. Koop, S. Sur, Q. Gao and D. K. Panda, High Performance MPI Design using Unreliable Datagram for Ultra-Scale InfiniBand Clusters, 21st Int'l ACM Conference on Supercomputing, June 2007. Conference Slides
-
W. Huang, J. Liu, M. Koop, B. Abali and D. K. Panda,
Nomad: Migrating OS-bypass Networks in Virtual Machines,
Third International ACM SIGPLAN/SIGOPS Conference on Virtual Execution
Environments (VEE),
San Diego, California,
June 13-15, 2007
-
A. Vishnu, M. Koop, A. Moody, A. Mamidala, S. Narravula and
D. K. Panda, Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An
MPI Perspective,
Int'l Symposium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007.
Conference Slides
-
M. Koop, T. Jones and D. K. Panda,
Reducing Connection Memory
Requirements of MPI for InfiniBand Clusters: A Message Coalescing
Approach,
Int'l Symposium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007.
Conference Slides
-
L. Chai, Q. Gao and D. K. Panda,
Understanding the Impact of
Multi-Core Architecture in Cluster Computing: A Case Study with Intel
Dual-Core System,
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007
- A. Vishnu, A. Mamidala, S. Narravula and D. K. Panda,
Automatic Path Migration over InfiniBand: Early Experiences,
Third International Workshop on
System Management Techniques, Processes, and Services,
to be held in conjunction with IPDPS '07,
March 2007. Conference Slides
- A. Vishnu, B. Benton and D. K. Panda,
High Performance MPI on IBM 12x InfiniBand Architecture,
International Workshop on
High-Level Parallel Programming Models and
Supportive Environments,
to be held in conjunction with IPDPS '07,
March 2007. Conference Slides
- K. Vaidyanathan, W. Huang, L. Chai and D. K. Panda,
Designing Efficient
Asynchronous Memory Operations Using Hardware Copy Engine: A Case Study
with I/OAT , International Workshop on
Communication Architecture for Clusters,
to be held in conjunction with IPDPS '07,
March 2007.
-
A. R Mamidala, S. Narravula, A. Vishnu, G. Santhanaraman, and D. K. Panda,
Using Connection-Oriented and Connection-Less
Transport on Performance and Scalability of Collective
and One-sided operations: Trade-offs and Impact,
International Symposium on Principles and Practice of Parallel
Programming (PPoPP 2007), March 2007, San Jose, California.
Conference Slides
- S. Sur, M. Koop and D. K. Panda, High-Performance and Scalable MPI
over InfiniBand with Reduced Memory Usage: An In-Depth Performance
Analysis, SC '06, November 2006.
- A. Vishnu, P. Gupta, A. Mamidala and D. K. Panda, A Software Based
Approach for Providing Network Fault Tolerance in Clusters Using the
uDAPL Interface: MPI Level Design and Performance Evaluation,
SC '06, November 2006.
- L. Chai, A. Hartono and D. K. Panda,
Designing Efficient MPI Intra-node Communication Support for Modern
Computer Architectures,
Int'l IEEE Conference on Cluster Computing, September 2006.
-
A. R. Mamidala, A. Vishnu, D. K. Panda,
Efficient Shared Memory and RDMA based design for MPI_Allgather over InfiniBand,
EuroPVM/MPI, September 2006.
- M. Koop, W. Huang, A. Vishnu and D. K. Panda,
Memory Scalability Evaluation of the Next-Generation
Intel Bensley Platform with InfiniBand,
Int'l Symposium on Hot Interconnect (HotI), August 2006.
- Q. Gao, W. Yu, W. Huang and D. K. Panda,
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand,
Int'l Conference on Parallel Processing (ICPP), August 2006.
- W. Huang, J. Liu, B. Abali and D. K. Panda,
A
Case for High Performance Computing with Virtual Machines,
Int'l Conference on Supercomputing (ICS), June 2006.
- Jiuxing Liu, Wei Huang, Bulent Abali and Dhabaleswar K. Panda, High Performance VMM-Bypass I/O in Virtual Machines , USENIX Annual Technical Conference, June 2006.
- W. Huang, G. Santhanaraman, H.-W. Jin, Q. Gao and D. K. Panda,
Design and Implementation of High
Performance MVAPICH2: MPI2 over InfiniBand,
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid), Singapore,
May 2006
- L. Chai, R. Noronha and D. K. Panda,
MPI over uDAPL: Can High Performance and
Portability Exist Across Architectures?
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid), Singapore,
May 2006
-
A. Mamidala, L. Chai, H.-W. Jin and D. K. Panda,
Efficient SMP-Aware MPI-Level Broadcast over
InfiniBand's Hardware Multicast,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
Conference Slides
-
P. Balaji, S. Bhagvat, H.-W. Jin and D. K. Panda,
Asynchronous Zero-Copy Communication for
Synchronous Sockets Direct Protocol (SDP) over InfiniBand,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
-
W. Yu, R. Noronha, S. Liang and D. K. Panda,
Benefits of High Speed Interconnects to
Cluster File Systems: A Case Study with Lustre,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
-
P. Balaji, K. Vaidyanathan, S. Narravula, H.-W. Jin, and D. K. Panda,
Designing Next-Generation Data-Centers with Advanced Communication Proto
cols and Systems Services, Workshop on NSF Next Generation Software(NGS) Pro
gram; held in conjuction with IPDPS, Greece, 2006.
-
S. Sur, L. Chai, H.-W. Jin and D. K. Panda,
Shared Receive Queue based
Scalable MPI Design for InfiniBand Clusters,
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
Conference Slides
- Weikuan Yu, Qi Gao and Dhabaleswar K. Panda, Adaptive Connection Management for Scalable MPI over InfiniBand , International Parallel and Distributed Processing Symposium (IPDPS 2006). Conference Slides
- S. Sur, H.-W. Jin, L. Chai and D. K. Panda RDMA Read Based Rendezvous Protocol for MPI over InfiniBand: Design Alternatives and Benefits , Symposium on Principles and Practice of Parallel Programming (PPOPP'06), March 29-31, 2006, Manhattan, New York City.
-
S. Sur, U. Bondhugula, A. Mamidala, H.-W. Jin, and D. K. Panda,
High
Performance RDMA Based All-to-all Broadcast for InfiniBand Clusters,
International Conference on High Performance Computing (HiPC 2005), December 18-21, 2005, Goa, India.
Conference Slides
- A. Vishnu, G. Santhanaraman, W. Huang, H. -W. Jin and D. K. Panda Supporting MPI-2 One Sided Communication on Multi-Rail InfiniBand Clusters: Design Challenges and Performance Benefits , International Conference on High Performance Computing (HiPC 2005), December 18-21, 2005, Goa, India. Accepted for publication. Conference Slides
- L. Chai, R. Noronha, P. Gupta, G. Brown, and D. K. Panda, Designing a Portable MPI-2 over Modern Interconnects Using uDAPL Interface, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- A. Mamidala, H. -W. Jin, and D. K. Panda, Efficient Hardware Multicast Group Management for Multiple MPI Communicators over InfiniBand, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- W. Huang, G. Santharaman, H. -W. Jin, and D. K. Panda, Design Alternatives and Performance Trade-offs for Implementing MPI-2 over InfiniBand, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- S. Sur, A. Vishnu, H. -W. Jin, W. Huang, and D. K. Panda, Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand Systems?, Hot Interconnect (HOTI 05), August, 2005. Conference Slides
- R. Noronha, and D. K. Panda, Performance Evaluation of MM5 on Clusters With Modern Interconnects: Scalability and Impact, Euro-Par, August 2005. Accepted for publication.
- H. -W. Jin, S. Sur, L. Chai, and D. K. Panda, LiMIC: Support for High-Performance MPI Intra-Node Communication on Linux Cluster, International Conference on Parallel Processing (ICPP-05), June 2005. Conference Slides
- L. Chai, S. Sur, H. -W. Jin, D. K. Panda, Analysis of Design Considerations for Optimizing Multi-Channel MPI over InfiniBand , Workshop on Communication Architecture on Clusters (CAC 05) in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2005). Technical Report. Conference Slides
- W. Huang, G. Santhanaraman, H. -W. Jin, D. K. Panda, Scheduling of MPI-2 One Sided Operations over InfiniBand, Workshop on Communication Architecture on Clusters (CAC 05) in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2005). Conference Slides
- A. Vishnu, A. R. Mamidala, H. -W. Jin and D. K. Panda, Performance Modeling of Subnet Management on Fat Tree InfiniBand Networks using OpenSM , Workshop on System Management Tools on Large Scale Parallel Systems, Held in Conjunction with IPDPS 2005. Conference Slides
- W. Yu, J. Wu and D. K. Panda, Scalable Startup of Parallel Programs over InfiniBand , International Conference on High Performance Computing, HiPC 2004, Bangalore, India. Conference Slides .
- J. Liu, A. Vishnu, D. K. Panda, Building Multirail InfiniBand Clusters: MPI-Level Design and Performance Evaluation, SuperComputing 2004 Conference (SC 04), November, 2004. Conference Slides
- A. Mamidala, J. Liu, and D. K. Panda, Efficient Barrier and Allreduce on IBA clusters using hardware multicast and adaptive algorithms, IEEE Cluster Computing 2004, Sept. 20-23 2004, San Diego, California. Conference Slides .
- G. Santhanaraman, J. Wu and D. K. Panda, Zero-Copy MPI Derived Datatype Communication over InfiniBand, EuroPVM/MPI 2004, Sept. 19-22 2004, Budapest, Hungary. Conference Slides .
- W. Jiang, J. Liu, H. -W. Jin, D. K. Panda, D. Buntinas, R. Thakur, and W. Gropp, Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters , EuroPVM/MPI 2004, Sept. 19-22 2004, Budapest, Hungary. Conference Slides .
- J. Liu, A. Mamidala, A. Vishnu, and D. K. Panda, Performance Evaluation of InfiniBand with PCI Express , Hot Interconnect 12 (HOTI 04), August, 2004. Conference Slides
- S. Sur, H. -W. Jin, and D. K. Panda, Efficient and Scalable All-to-All Exchange for InfiniBand-based Clusters. International Conference on Parallel Processing (ICPP-04), Aug. 15-18, 2004, Montreal, Quebec, Canada. Conference Slides
-
J. Liu, W. Jiang, P. Wyckoff, D. K. Panda, D. Ashton, D. Buntinas,
W. Gropp, and B. Toonen.
Design and Implementation of MPICH2 over InfiniBand with RDMA Support.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
J. Liu, A. Mamidala and D. K. Panda.
Fast and Scalable MPI-Level Broadcast using InfiniBand's Hardware Multicast Support.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
Jiesheng Wu, Pete Wyckoff, and Dhabaleswar K. Panda.
High Performance Implementation of MPI Datatype Communication over InfiniBand.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
-
Jiuxing Liu and Dhabaleswar K. Panda.
Implementing Efficient and Scalable Flow Control Schemes in MPI over InfiniBand.
In Workshop on Communication Architecture for Clusters (CAC 04), Held in Conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
W. Jiang, J. Liu, H. -W. Jin, D. K. Panda, W. Gropp, and R. Thakur.
High Performance MPI-2 One-Sided Communication over InfiniBand.
In 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 04), April, 2004.
Conference Slides .
-
Jiuxing Liu, Balasubramanian Chandrasekaran, Jiesheng Wu, Weihang Jiang,
Sushmitha Kini, Weikuan Yu, Darius Buntinas, Pete Wyckoff, and Dhabaleswar K. Panda.
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics.
In SuperComputing 2003 Conference, Pheonix, AZ, November, 2003.
- Sushmitha P. Kini, Jiuxing Liu, Jiesheng Wu, Pete Wyckoff, and Dhabaleswar K. Panda. Fast and Scalable Barrier using RDMA and Multicast Mechanisms for InfiniBand-Based Clusters. In Euro PVM/MPI Conference, September 29-Oct 2, 2003, Venice, Italy.
-
Jiuxing Liu, Jiesheng Wu, Sushmitha P. Kini, Peter Wyckoff, and Dhabaleswar K. Panda.
High Performance RDMA-Based MPI Implementation over InfiniBand.
In the Proceedings of 17th Annual ACM International Conference on Supercomputing. San Francisco Bay Area. June, 2003.
Technical Reports
-
H. -W. Jin, S. Sur, L. Chai and D. K. Panda,
Design and Performance Evaluation
of LiMIC (Linux Kernel Module for MPI Intra-node Communication) on InfiniBand
Cluster, OSU-CISRC-10/04-TR58.
PhD Dissertations
- Abhinav Vishnu, High Performance and Network Fault Tolerant MPI with Multi-Pathing Over InfiniBand, Dec 2007.
- Sayantan Sur, Scalable and High Performance MPI Design for Very Large InfiniBand Clusters, August 2007.
- Jiuxing Liu, Designing High Performance and Scalable MPI over InfiniBand, Sept 2003.
M.S. Thesis
- W. Jiang, High Performance MPICH2 One-Sided Communication Implementation over InfiniBand, , June 2004.
- B. Chandrasekharan, Micro-benchmark Level Performance Evaluation and Comparison of High Speed Cluster Interconnects September 2003.

