Publications
This page lists publications from the group related to designing High Performance MPI on InfiniBand. In addition, the group is also actively engaged in other research directions (PVFS and MPI-IO, Micro-Benchmark suite, Distributed Shared Memory, ARMCI, and Datacenter) related to modern interconnects. Publications related to these research directions are also included in the corresponding links.
- High Performance MPI on IBA
- Clustered Storage Systems
- Solaris NFS over RDMA Project
- Efficient Shared Memory on High-Speed Interconnects
- Design of Scalable Data-Centers with InfiniBand
- High Performance Networking for TCP-based Applications
- High Performance Computing with Virtual Machines
- NIC-level Support for Collective Communication and Synchronization with Myrinet and Quadrics
- NIC-level Support for Quality of Service (QoS)
- Micro-benchmarks and Performance Comparision of High-Speed Interconnects
High Performance MPI on IBA
Journals
- J. Liu, A. Mamidala, A. Vishnu and D. K. Panda. Performance Evaluation of InfiniBand with PCI Express . IEEE Micro, 2005.
-
J. Liu, J. Wu, and D. K. Panda.
High Performance RDMA-Based MPI Implementation over InfiniBand.
In Int'l Journal of Parallel Programming, 2004.
-
J. Liu, B. Chandrasekaran, W. Yu, J. Wu, D. Buntinas, S. Kini, P. Wyckoff, and D. K. Panda.
Micro-Benchmark Performance Comparison of High-Speed Cluster Interconnects.
IEEE Micro, January/February, 2004.
Conferences/Workshops
- X. Ouyang, K. Gopalakrishnan, T. Gangadharappa D. K. Panda, Fast Checkpointing by Write Aggregation with Dynamic Buffer and Interleaving on Multicore Architecture, Int'l Conference on High Performance Computing (HiPC' 09), Dec. 2009.
- X. Ouyang, K. Gopalakrishnan and D. K. Panda, Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore S ystems, Int'l Conference on Parallel Processing (ICPP '09), Sept. 2009. Conference Slides.
- T. Gangadharappa, M. Koop and D. K. Panda, Designing and Evaluating MPI-2 Dynamic Process Management Support for InfiniBand , Int'l Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2 '09), in Conjunction with ICPP '09, Sept. 2009. Conference Slides.
- J. Sridhar and D. K. Panda, Impact of Node Level Caching in MPI Job Launch Mechanisms, The 16th Euro PVM/MPI conference, September 2009. Conference Slides.
- G. Santhanaraman, T. Gangadharappa, S. Narravula, A. Mamidala, D. K. Panda, Design Alternatives for Implementing Fence Synchronization in MPI-2 One-sided Communication on InfiniBand Clusters, IEEE Cluster 2009, September 2009. Conference Slides.
- M. Koop, M. Luo, D. K. Panda, Reducing Network Contention with Mixed Workloads on Modern Multicore Clusters, IEEE Cluster 2009, September 2009. Conference Slides.
- H. Subramoni, P. Lai, M. Luo, D. K. Panda, RDMA over Ethernet - A Preliminary Study, Workshop on High Performance Interconnects for Distributed Computing (HPIDC'09), September 2009. Conference Slides.
- H. Subramoni, M. Koop, and D. K. Panda, Designing Next Generation Clusters: Evaluation of InfiniBand DDR/QDR on Intel Computing Platforms, 17th Annual Symposium on High-Performance Interconnects (HotI'09), August 2009.
- G. Santhanaraman, P. Balaji, K. Gopalakrishnan, R. Thakur, W. Gropp and D. K. Panda, Natively Supporting True One-sided Communication in MPI on Multi-core Systems with InfiniBand, Int'l Symposium on Cluster Computing and the Grid (CCGrid), May 2009.
- M. Koop, J. Sridhar and D. K. Panda, TupleQ: Fully-Asynchronous and Zero-Copy MPI over InfiniBand, 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 09), May 2009. Conference Slides.
- K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop and D. K. Panda, Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters, The 9th Workshop on Communication Architecture for Clusters (CAC 09), May 2009. Conference Slides.
- J. Sridhar, M. Koop, J. Perkins and D. K. Panda, ScELA: Scalable and Extensible Launching Architecture for Clusters, International Conference on High Performance Computing (HiPC 08), December 2008.
- M. Koop, J. Sridhar and D.K. Panda, Scalable MPI Design over InfiniBand using eXtended Reliable Connection, IEEE Cluster 2008, September 2008. Conference Slides.
- W. Huang, M. Koop and D.K. Panda, Efficient One-Copy MPI Shared Memory Communication in Virtual Machines, IEEE Cluster 2008, September 2008. Conference Slides.
- L. Chai, P. Lai, H.-W. Jin and D. K. Panda, Designing An Efficient Kernel-level and User-level Hybrid Approach for MPI Intra-node Communication on Multi-core Systems, Int'l Conference on Parallel Processing (ICPP '08), September, 2008. Conference Slides.
-
R. Kumar, A. Mamidala, M. Koop, G. Santhanaraman and D.K. Panda,
Lock-free Asynchronous Rendezvous Design for MPI Point-to-Point Communication,
EuroPVM/MPI 2008, Sept. 2008.
Conference Slides
-
M. Koop, W. Huang, K. Gopalakrishnan and D.K. Panda,
Performance Analysis and Evaluation of PCIe 2.0 and Quad-Data Rate InfiniBand
16th IEEE Symposium on Hot Interconnects, August 2008.
Conference Slides
- M. Koop, R. Kumar, and D. K. Panda, Can Software Reliability Outperform Hardware Reliability on High Performance Interconnects? A Case Study with MPI over InfiniBand, 22nd ACM International Conference on Supercomputing (ICS08), Island of Kos, Greece, June 2008. Conference Slides
- A. Mamidala, R. Kumar, D. De and D. K. Panda, MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics, Int'l Symposium on Cluster Computing and the Grid (CCGrid), Lyon, France, May 2008.Conference Slides
- M. Koop, T. Jones, and D. K. Panda, MVAPICH-Aptus: Scalable High-Performance Multi-Transport MPI over InfiniBand, IEEE International Parallel and Distributed Processing Symposium (IPDPS '08), Miami, Florida, April 2008. Conference Slides
- G. Santhanaraman, S. Narravula and D. K. Panda, Designing Passive Synchronization for MPI-2 One-Sided Communication to Maximize Overlap , IEEE International Parallel and Distributed Processing Symposium (IPDPS '08), Miami, Florida, April, 2008. Conference Slides
- R. Kumar, A. Mamidala and D. K. Panda, Scaling Alltoall Collective on Multi-core Systems, Workshop on Communication Architecture for Clusters, to be held in conjunction with IPDPS '08, Miami, Florida, April 2008. Conference Slides
- M. Koop, S. Sur and D. K. Panda, Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram, IEEE International Conference on Cluster Computing (Cluster'07), Austin, TX, September 2007. Conference Slides
- S. Narravula, A. R. Mamidala, A. Vishnu, G. Santhanaraman, and D. K. Panda, High Performance MPI over iWARP: Early Experiences, Int'l Conference on Parallel Processing, XiAn, China, September 2007. Conference Slides
- S. Sur, M. Koop, L. Chai and D. K. Panda, Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms, 15th Symposium on Hot Interconnects, August 2007. Conference Slides
- M. Koop, S. Sur, Q. Gao and D. K. Panda, High Performance MPI Design using Unreliable Datagram for Ultra-Scale InfiniBand Clusters, 21st Int'l ACM Conference on Supercomputing, June 2007. Conference Slides
-
W. Huang, J. Liu, M. Koop, B. Abali and D. K. Panda,
Nomad: Migrating OS-bypass Networks in Virtual Machines,
Third International ACM SIGPLAN/SIGOPS Conference on Virtual Execution
Environments (VEE),
San Diego, California,
June 13-15, 2007
-
A. Vishnu, M. Koop, A. Moody, A. Mamidala, S. Narravula and
D. K. Panda, Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An
MPI Perspective,
Int'l Symposium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007.
Conference Slides
-
M. Koop, T. Jones and D. K. Panda,
Reducing Connection Memory
Requirements of MPI for InfiniBand Clusters: A Message Coalescing
Approach,
Int'l Symposium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007.
Conference Slides
-
L. Chai, Q. Gao and D. K. Panda,
Understanding the Impact of
Multi-Core Architecture in Cluster Computing: A Case Study with Intel
Dual-Core System,
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007
- A. Vishnu, A. Mamidala, S. Narravula and D. K. Panda,
Automatic Path Migration over InfiniBand: Early Experiences,
Third International Workshop on
System Management Techniques, Processes, and Services,
to be held in conjunction with IPDPS '07,
March 2007. Conference Slides
- A. Vishnu, B. Benton and D. K. Panda,
High Performance MPI on IBM 12x InfiniBand Architecture,
International Workshop on
High-Level Parallel Programming Models and
Supportive Environments,
to be held in conjunction with IPDPS '07,
March 2007. Conference Slides
- K. Vaidyanathan, W. Huang, L. Chai and D. K. Panda,
Designing Efficient
Asynchronous Memory Operations Using Hardware Copy Engine: A Case Study
with I/OAT , International Workshop on
Communication Architecture for Clusters,
to be held in conjunction with IPDPS '07,
March 2007.
-
A. R Mamidala, S. Narravula, A. Vishnu, G. Santhanaraman, and D. K. Panda,
Using Connection-Oriented and Connection-Less
Transport on Performance and Scalability of Collective
and One-sided operations: Trade-offs and Impact,
International Symposium on Principles and Practice of Parallel
Programming (PPoPP 2007), March 2007, San Jose, California.
Conference Slides
- S. Sur, M. Koop and D. K. Panda, High-Performance and Scalable MPI
over InfiniBand with Reduced Memory Usage: An In-Depth Performance
Analysis, SC '06, November 2006.
- A. Vishnu, P. Gupta, A. Mamidala and D. K. Panda, A Software Based
Approach for Providing Network Fault Tolerance in Clusters Using the
uDAPL Interface: MPI Level Design and Performance Evaluation,
SC '06, November 2006.
- L. Chai, A. Hartono and D. K. Panda,
Designing Efficient MPI Intra-node Communication Support for Modern
Computer Architectures,
Int'l IEEE Conference on Cluster Computing, September 2006.
-
A. R. Mamidala, A. Vishnu, D. K. Panda,
Efficient Shared Memory and RDMA based design for MPI_Allgather over InfiniBand,
EuroPVM/MPI, September 2006.
- M. Koop, W. Huang, A. Vishnu and D. K. Panda,
Memory Scalability Evaluation of the Next-Generation
Intel Bensley Platform with InfiniBand,
Int'l Symposium on Hot Interconnect (HotI), August 2006.
- Q. Gao, W. Yu, W. Huang and D. K. Panda,
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand,
Int'l Conference on Parallel Processing (ICPP), August 2006.
- W. Huang, J. Liu, B. Abali and D. K. Panda,
A
Case for High Performance Computing with Virtual Machines,
Int'l Conference on Supercomputing (ICS), June 2006.
- Jiuxing Liu, Wei Huang, Bulent Abali and Dhabaleswar K. Panda, High Performance VMM-Bypass I/O in Virtual Machines , USENIX Annual Technical Conference, June 2006.
- W. Huang, G. Santhanaraman, H.-W. Jin, Q. Gao and D. K. Panda,
Design and Implementation of High
Performance MVAPICH2: MPI2 over InfiniBand,
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid), Singapore,
May 2006
- L. Chai, R. Noronha and D. K. Panda,
MPI over uDAPL: Can High Performance and
Portability Exist Across Architectures?
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid), Singapore,
May 2006
-
A. Mamidala, L. Chai, H.-W. Jin and D. K. Panda,
Efficient SMP-Aware MPI-Level Broadcast over
InfiniBand's Hardware Multicast,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
Conference Slides
-
P. Balaji, S. Bhagvat, H.-W. Jin and D. K. Panda,
Asynchronous Zero-Copy Communication for
Synchronous Sockets Direct Protocol (SDP) over InfiniBand,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
-
W. Yu, R. Noronha, S. Liang and D. K. Panda,
Benefits of High Speed Interconnects to
Cluster File Systems: A Case Study with Lustre,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
-
P. Balaji, K. Vaidyanathan, S. Narravula, H.-W. Jin, and D. K. Panda,
Designing Next-Generation Data-Centers with Advanced Communication Proto
cols and Systems Services, Workshop on NSF Next Generation Software(NGS) Pro
gram; held in conjuction with IPDPS, Greece, 2006.
-
S. Sur, L. Chai, H.-W. Jin and D. K. Panda,
Shared Receive Queue based
Scalable MPI Design for InfiniBand Clusters,
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
Conference Slides
- Weikuan Yu, Qi Gao and Dhabaleswar K. Panda, Adaptive Connection Management for Scalable MPI over InfiniBand , International Parallel and Distributed Processing Symposium (IPDPS 2006). Conference Slides
- S. Sur, H.-W. Jin, L. Chai and D. K. Panda RDMA Read Based Rendezvous Protocol for MPI over InfiniBand: Design Alternatives and Benefits , Symposium on Principles and Practice of Parallel Programming (PPOPP'06), March 29-31, 2006, Manhattan, New York City.
-
S. Sur, U. Bondhugula, A. Mamidala, H.-W. Jin, and D. K. Panda,
High
Performance RDMA Based All-to-all Broadcast for InfiniBand Clusters,
International Conference on High Performance Computing (HiPC 2005), December 18-21, 2005, Goa, India.
Conference Slides
- A. Vishnu, G. Santhanaraman, W. Huang, H. -W. Jin and D. K. Panda Supporting MPI-2 One Sided Communication on Multi-Rail InfiniBand Clusters: Design Challenges and Performance Benefits , International Conference on High Performance Computing (HiPC 2005), December 18-21, 2005, Goa, India. Accepted for publication. Conference Slides
- L. Chai, R. Noronha, P. Gupta, G. Brown, and D. K. Panda, Designing a Portable MPI-2 over Modern Interconnects Using uDAPL Interface, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- A. Mamidala, H. -W. Jin, and D. K. Panda, Efficient Hardware Multicast Group Management for Multiple MPI Communicators over InfiniBand, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- W. Huang, G. Santharaman, H. -W. Jin, and D. K. Panda, Design Alternatives and Performance Trade-offs for Implementing MPI-2 over InfiniBand, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- S. Sur, A. Vishnu, H. -W. Jin, W. Huang, and D. K. Panda, Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand Systems?, Hot Interconnect (HOTI 05), August, 2005. Conference Slides
- R. Noronha, and D. K. Panda, Performance Evaluation of MM5 on Clusters With Modern Interconnects: Scalability and Impact, Euro-Par, August 2005. Accepted for publication.
- H. -W. Jin, S. Sur, L. Chai, and D. K. Panda, LiMIC: Support for High-Performance MPI Intra-Node Communication on Linux Cluster, International Conference on Parallel Processing (ICPP-05), June 2005. Conference Slides
- L. Chai, S. Sur, H. -W. Jin, D. K. Panda, Analysis of Design Considerations for Optimizing Multi-Channel MPI over InfiniBand , Workshop on Communication Architecture on Clusters (CAC 05) in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2005). Technical Report. Conference Slides
- W. Huang, G. Santhanaraman, H. -W. Jin, D. K. Panda, Scheduling of MPI-2 One Sided Operations over InfiniBand, Workshop on Communication Architecture on Clusters (CAC 05) in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2005). Conference Slides
- A. Vishnu, A. R. Mamidala, H. -W. Jin and D. K. Panda, Performance Modeling of Subnet Management on Fat Tree InfiniBand Networks using OpenSM , Workshop on System Management Tools on Large Scale Parallel Systems, Held in Conjunction with IPDPS 2005. Conference Slides
- W. Yu, J. Wu and D. K. Panda, Scalable Startup of Parallel Programs over InfiniBand , International Conference on High Performance Computing, HiPC 2004, Bangalore, India. Conference Slides .
- J. Liu, A. Vishnu, D. K. Panda, Building Multirail InfiniBand Clusters: MPI-Level Design and Performance Evaluation, SuperComputing 2004 Conference (SC 04), November, 2004. Conference Slides
- A. Mamidala, J. Liu, and D. K. Panda, Efficient Barrier and Allreduce on IBA clusters using hardware multicast and adaptive algorithms, IEEE Cluster Computing 2004, Sept. 20-23 2004, San Diego, California. Conference Slides .
- G. Santhanaraman, J. Wu and D. K. Panda, Zero-Copy MPI Derived Datatype Communication over InfiniBand, EuroPVM/MPI 2004, Sept. 19-22 2004, Budapest, Hungary. Conference Slides .
- W. Jiang, J. Liu, H. -W. Jin, D. K. Panda, D. Buntinas, R. Thakur, and W. Gropp, Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters , EuroPVM/MPI 2004, Sept. 19-22 2004, Budapest, Hungary. Conference Slides .
- J. Liu, A. Mamidala, A. Vishnu, and D. K. Panda, Performance Evaluation of InfiniBand with PCI Express , Hot Interconnect 12 (HOTI 04), August, 2004. Conference Slides
- S. Sur, H. -W. Jin, and D. K. Panda, Efficient and Scalable All-to-All Exchange for InfiniBand-based Clusters. International Conference on Parallel Processing (ICPP-04), Aug. 15-18, 2004, Montreal, Quebec, Canada. Conference Slides
-
J. Liu, W. Jiang, P. Wyckoff, D. K. Panda, D. Ashton, D. Buntinas,
W. Gropp, and B. Toonen.
Design and Implementation of MPICH2 over InfiniBand with RDMA Support.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
J. Liu, A. Mamidala and D. K. Panda.
Fast and Scalable MPI-Level Broadcast using InfiniBand's Hardware Multicast Support.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
Jiesheng Wu, Pete Wyckoff, and Dhabaleswar K. Panda.
High Performance Implementation of MPI Datatype Communication over InfiniBand.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
-
Jiuxing Liu and Dhabaleswar K. Panda.
Implementing Efficient and Scalable Flow Control Schemes in MPI over InfiniBand.
In Workshop on Communication Architecture for Clusters (CAC 04), Held in Conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
W. Jiang, J. Liu, H. -W. Jin, D. K. Panda, W. Gropp, and R. Thakur.
High Performance MPI-2 One-Sided Communication over InfiniBand.
In 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 04), April, 2004.
Conference Slides .
-
Jiuxing Liu, Balasubramanian Chandrasekaran, Jiesheng Wu, Weihang Jiang,
Sushmitha Kini, Weikuan Yu, Darius Buntinas, Pete Wyckoff, and Dhabaleswar K. Panda.
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics.
In SuperComputing 2003 Conference, Pheonix, AZ, November, 2003.
- Sushmitha P. Kini, Jiuxing Liu, Jiesheng Wu, Pete Wyckoff, and Dhabaleswar K. Panda. Fast and Scalable Barrier using RDMA and Multicast Mechanisms for InfiniBand-Based Clusters. In Euro PVM/MPI Conference, September 29-Oct 2, 2003, Venice, Italy.
-
Jiuxing Liu, Jiesheng Wu, Sushmitha P. Kini, Peter Wyckoff, and Dhabaleswar K. Panda.
High Performance RDMA-Based MPI Implementation over InfiniBand.
In the Proceedings of 17th Annual ACM International Conference on Supercomputing. San Francisco Bay Area. June, 2003.
PhD Dissertations
- Matthew Koop, High-Performance Multi-Transport MPI Design for Ultra-Scale InfiniBand Clusters, June 2009.
- Gopalakrishnan Santhanaraman, Designing Scalable and High Performance One Sided Communication Middleware for Modern Interconnects, June 2009.
- Lei Chai, High Performance and Scalable MPI Intra-Node Communication Middleware for Multi-core Clusters, February 2009.
- Wei Huang, High Performance Network I/O in Virtual Machines Over Modern Interconnects, August 2008.
- Amith Mamidala, Scalable and High Performance Collective Communication For Next Generation Multicore InfiniBand Clusters, May 2008.
- Abhinav Vishnu, High Performance and Network Fault Tolerant MPI with Multi-Pathing Over InfiniBand, Dec 2007.
- Sayantan Sur, Scalable and High Performance MPI Design for Very Large InfiniBand Clusters, August 2007.
- Jiuxing Liu, Designing High Performance and Scalable MPI over InfiniBand, Sept 2003.
M.S. Thesis
- T. Gangadharappa, Designing Support for MPI-2 Programming Interfaces on Modern Interconnects, June 2009.
- R. Kumar, Enhancing MPI Point-to-Point and Collectives for Clusters with Onloaded/Offloaded InfiniBand Adapters, August 2008.
- S. Bhagvat, Designing and Enhancing the Sockets Direct Protocol (SDP) over iWARP and InfiniBand, August 2006.
- W. Jiang, High Performance MPICH2 One-Sided Communication Implementation over InfiniBand, , June 2004.
- B. Chandrasekharan, Micro-benchmark Level Performance Evaluation and Comparison of High Speed Cluster Interconnects September 2003.

