Publications
This page lists publications from the group related to designing High Performance MPI on InfiniBand. In addition, the group is also actively engaged in other research directions (PVFS and MPI-IO, Micro-Benchmark suite, Distributed Shared Memory, ARMCI, and Datacenter) related to modern interconnects. Publications related to these research directions are also included in the corresponding links.
- High Performance MPI on IBA
- High Performance MPI for Clusters with GPUs
- Networking, Storage and Middleware Support for Cloud Computing and DataCenters
- High Performance Runtime for UPC/PGAS
- Fault Tolerance Backplane - InfiniBand
- Clustered Storage and File Systems
- Open Solaris NFS over RDMA Project
- Efficient Shared Memory on High-Speed Interconnects
- High Performance Networking for TCP-based Applications
- High Performance Computing with Virtual Machines
- Design and Evaluation of Communications Protocols over InfiniBand WAN
High Performance MPI on IBA
Journals
-
S. Sur, S. Potluri, K. Kandalla, H. Subramoni, K. Tomko and D. K. Panda,
Co-Designing MPI Library and Applications for InfiniBand Clusters ,
IEEE Computer, November 2011.
- P. Lai, P. Balaji, R. Thakur, and D. K. Panda, ProOnE: A General-Purpose Protocol Onload Engine for Multi- and Many-Core Architectures, Computer Science: Research and Development, Special Issue of Scientific Papers from ISC '09, 2009.
- A. Vishnu, M. Koop, A. Moody, A. Mamidala, S. Narravula, and D. K. Panda, Topology Agnostic Hot-Spot Avoidance with InfiniBand, Concurrency and Computation: Practice and Experience, Special Issue of Best Papers from CCGrid '07, 2008.
-
J. Liu, A. Mamidala, A. Vishnu and D. K. Panda.
Performance Evaluation of InfiniBand with PCI Express .
IEEE Micro, 2005.
-
J. Liu, J. Wu, and D. K. Panda.
High Performance RDMA-Based MPI Implementation over InfiniBand.
In Int'l Journal of Parallel Programming, 2004.
-
J. Liu, B. Chandrasekaran, W. Yu, J. Wu, D. Buntinas, S. Kini, P. Wyckoff, and D. K. Panda.
Micro-Benchmark Performance Comparison of High-Speed Cluster Interconnects.
IEEE Micro, January/February, 2004.
Conferences/Workshops
-
H. Subramoni, S. Potluri, K. Kandalla, B. Barth, J. Vienne, J. Keasler,
K. Tomko, K. Schulz, A. Moody and D. K. Panda,
Design of a Scalable InfiniBand Topology Service to Enable
Network-Topology-Aware Placement of Processes,
Int'l Conference on Supercomputing (SC '12),
November 2012.
Best Paper and Best Student Paper Finalist.
-
M. Luo, H. Wang and D. K. Panda,
Multi-Threaded UPC Runtime for GPU to GPU communication over InfiniBand,
Int'l Conference on Partitioned Global Address Space Programming
Models (PGAS '12),
October 2012.
-
S. Potluri, K. Kandalla, D. Bureddy, M.M. Luo, H. Wang and
D. K. Panda,
Efficient Intranode Desgins for OpenSHMEM on Multicore Clusters,
Int'l Conference on Partitioned Global Address Space Programming
Models (PGAS '12),
October 2012.
-
D. Bureddy, H. Wang, A. Venkatesh, S. Potluri and D. K. Panda,
OMB-GPU: A Micro-benchmark suite for Evaluating MPI Libraries on GPU Clusters,
EuroMPI 2012,
September 2012.
-
R. Rajachandrasekar, J. Jaswani, H. Subramoni and D. K. Panda,
Minimizing Network Contention in InfiniBand Clusters with a
QoS-Aware Data-Staging Framework,
IEEE Cluster (Cluster '12),
September 2012.
-
K. Kandalla, A. Buluc¸ H. Subramoni, K. Tomko, J. Vienne, L. Oliker
and D. K. Panda,
Can Network-Offload based Non-Blocking Neighborhood MPI Collectives
Improve Communication Overheads of Irregular Graph Algorithms?
Int'l Workshop on Parallel Algorithm and Parallel Software (IWPAPS12),
held in conjunction with IEEE Cluster (Cluster '12),
September 2012.
-
J. Jose, K. Kandalla, M. Luo and D. K. Panda,
Supporting Hybrid MPI and OpenSHMEM over InfiniBand: Design
and Performance Evaluation,
Int'l Conference on Parallel Processing (ICPP '12),
September 2012.
-
H. Subramoni, J. Vienne and D. K. Panda,
A Scalable InfiniBand Network-Topology-Aware
Performance Analysis Tool for MPI,
Int'l Workshop on Productivity and Performance (Proper '12),
held in conjunction with EuroPar,
August 2012.
-
J. Vienne, J. Chen, M.-W. Rahman, N. Islam, H. Subramoni
and D. K. Panda,
Performance Analysis and Evaluation of InfiniBand FDR and 40GigE RoCE
on HPC and Cloud Computing System,
Int'l Symposium on High-Performance Interconnects (HotI 2012),
August 2012.
-
M. Luo, D. K. Panda,
C. Iancu and K. Z. Ibrahim,
Congestion Avoidance on Manycore High Performance
Computing Systems,
Int'l Conference on Supercomputing (ICS '12),
June 2012.
-
M. Luo, H. Wang, J. Vienne and
D. K. Panda,
Redesigning MPI Shared Memory Communication for Large
Multi-Core Architecture,
Int'l Supercomputing Conference (ISC '12),
June 2012.
-
K. Kandalla, U. Yang, J. Keasler, T. Kolev, A. Moody,
H. Subramoni, K. Tomko, J. Vienne and D. K. Panda,
Designing Non-blocking Allreduce with Collective Offload
on InfiniBand Clusters: A Case Study with
Conjugate Gradient Solvers
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
-
S. P. Raikar, H. Subramoni, K. Kandalla, J. Vienne and
D. K. Panda,
Designing Network Failover and Recovery in MPI for Multi-Rail
InfiniBand Clusters,
Int'l Workshop on System Management Techniques, Processes,
and Services (SMTPS), in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
-
R. Rajachandrasekar, X. Besseron and
D. K. Panda,
Monitoring and Predicting Hardware Failures in
HPC Clusters with FTB-IPMI,
Int'l Workshop on System Management Techniques, Processes,
and Services (SMTPS), in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
-
S. Potluri, H. Wang, D. Bureddy, A. K. Singh, C. Rosales and
D. K. Panda,
Optimizing MPI Communication on Multi-GPU Systems
using CUDA Inter-Process Communication,
Int'l Workshop on Accelerators and Hybrid Exascale Systems (AsHES),
in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '12),
May 2012.
Slides
-
S. Potluri, K. Tomko, D. Bureddy and
D. K. Panda,
Intra-MIC MPI Communication using MVAPICH2: Early Experience,
TACC-Intel Highly-Parallel Computing Symposium,
April 2012,
Best Student Paper Award.
Conference Slides
-
M. Luo, J. Jose, S. Sur and D. K. Panda,
Multi-threaded UPC Runtime with
Network Endpoints: Design Alternatives and Evaluation on Multi-core
Architectures,
Int'l Conference on High Performance Computing (HiPC '11),
Dec. 2011.
- J. Jose, S. Potluri, M. Luo, S. Sur and D. K. Panda,
UPC Queues for Scalable
Graph Traversals: Design and Evaluation on InfiniBand Clusters,
Fifth Conference on Partitioned Global Address Space
Programming Model (PGAS '11),
Oct. 2011.
-
A. Singh, S. Potluri, H. Wang, K. Kandalla, S. Sur and D. K. Panda,
MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives
and Benefits,
Workshop on Parallel Programming on Accelerator Clusters (PPAC '11),
held in conjunction with Cluster '11, Sept. 2011.
-
H. Subramoni, K. Kandalla, J. Vienne, S. Sur, B. Barth, K. Tomko,
R. McLay, K. Schulz and D. K. Panda,
Design and Evaluation of Network Topology-/Speed-Aware Broadcast Algorithms
for InfiniBand Clusters,
IEEE Cluster '11, Sept. 2011.
-
H. Wang, S. Potluri, M. Luo, A. Singh, X. Ouyang, S. Sur and
D. K. Panda,
Optimized Non-contiguous MPI Datatype Communication for GPU Clusters:
Design, Implementation and Evaluation with MVAPICH2,
IEEE Cluster '11, Sept. 2011.
-
S. Potluri, H. Wang, V. Dhanraj, S. Sur and D. K. Panda,
Optimizing MPI One Sided Communication on Multi-core InfiniBand Clusters
using Shared Memory Backed Windows,
EuroMPI '11, Sept. 2011.
-
S. Potluri, S. Sur, D. Bureddy and D. K. Panda,
Design and Implementation of Key Proposed MPI-3 One-Sided Communication
Semantics on InfiniBand, Poster/Short Paper,
EuroMPI '11, Sept. 2011.
-
X. Ouyang, R. Rajachandrasekar, X. Besseron, H. Wang, J. Huang
and D. K. Panda,
CRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart,
Int'l Conference on Parallel Processing (ICPP '11), Sept. 2011.
- R. Rajachandrasekar, X. Ouyang, X. Besseron, V. Meshram and
D. K. Panda,
Can Checkpoint/Restart Mechanisms Benefit from Hierarchical Data Staging?
Workshop on Resiliency in High Performance Computing
in Clusters, Clouds, and Grids (Resilience '11), held
in conjunction with EuroPar, Aug. 2011.
-
N. Dandapanthula, H. Subramoni, J. Vienne, K. Kandalla, S. Sur,
D. K. Panda, and R. Brightwell,
INAM - A Scalable InfiniBand Network Analysis and Monitoring Tool,
4th Int'l Workshop on Productivity and Performance (PROPER 2011),
in conjunction with EuroPar, Aug. 2011.
-
K. Kandalla, H. Subramoni, J. Vienne, K. Tomko, S. Sur
and D. K. Panda,
Designing Non-blocking Broadcast
with Collective Offload on InfiniBand Clusters: A Case Study with HPL,
Hot Interconnect '11, Aug. 2011.
Conference Slides
-
K. Kandalla, H. Subramoni, K. Tomko, D. Pekurovsky,
S. Sur and D. K. Panda,
High-Performance and Scalable Non-Blocking All-to-All with Collective
Offload on InfiniBand Clusters: A Study with Parallel 3D FFT,
Int'l Supercomputing Conference (ISC), June 2011.
Conference Slides
-
H. Wang, S. Potluri, M. Luo, A. Singh,
S. Sur and D. K. Panda,
MVAPICH2-GPU: Optimized GPU to GPU Communication for
InfiniBand Clusters,
Int'l Supercomputing Conference (ISC), June 2011.
Conference Slides
-
X. Ouyang, R. Rajachandrasekar, X. Besseron, D. K. Panda,
High Performance Pipelined Process Migration with RDMA,
The 11th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing
(CCGrid 2011), Newport Beach, CA, May 2011.
Conference Slides
-
Y. Cui, K.B. Olsen, T. H. Jordan, K. Lee, J. Zhou, P. Small, D. Roten,
G. Ely, D .K. Panda, A. Chourasia, J. Levesque, S. M. Day,
P. Maechling
Scalable Earthquake Simulation on Petascale Supercomputers,
SuperComputing (SC), November 2010. Gordon Bell
Prize Finalist
-
J. Jose, M. Luo, S. Sur and D. K. Panda, Unifying UPC and MPI Runtimes: Experience with MVAPICH,
Fourth Conference on Partitioned Global Address Space Programming
Model (PGAS10) , Oct. 2010
Conference Slides.
-
X. Ouyang, S. Marcarelli, R. Rajachandrasekar and D. K. Panda, RDMA-Based Job Migration Framework for MPI over InfiniBand,
IEEE International Conference on Cluster Computing 2010 (Cluster '10), Sept. 2010,
Conference Slides.
-
H. Subramoni, P. Lai, S. Sur and D. K. Panda,
Improving Application Performance and Predictability using Multipl e
Virtual Lanes in Modern Multi-Core InfiniBand Clusters ,
International Conference on Parallel Processing (ICPP '10),
Sept. 2010. Conference Slides.
-
K. Kandalla, E. P. Mancini, S. Sur and D. K. Panda, Designing Power-Aware Collective Communication Algorithms
for Infin iBand Clusters , International Conference on Parallel Processing (ICPP '10), Sept. 2010. Conference Slides.
-
M. Luo, S. Potluri, P. Lai, E. P. Mancini, H. Subramoni, K. Kandalla,
S. Sur and D. K. Panda,
High Performance Design and Implementation of Nemesis Communication
Layer for Two-sided and One-Sided MPI Semantics in MVAPICH2 ,
Int'l Workshop on Parallel Programming Models and Systems Software for
High-End Computing (P2S2 '10), in Conjunction with ICPP '10,
Sept. 2010, Conference Slides.
-
H. Subramoni, K. Kandalla, S. Sur and D. K. Panda,
Design and Evaluation of Generalized Collective Communication
Primitives with Overlap using ConnectX-2 Offload Engine ,
Int'l Symposium on Hot Interconnects (HotI), Aug. 2010
Conference Slides>.
-
S. Potluri, P. Lai, K. Tomko, S. Sur, Y. Cui, M. Tatineni, K. W. Schulz, W. L. Barth, A. Majumdar and D. K. Panda,
Quantifying Performance Benefits of Overlap using MPI-2 in a Seismic Modeling Application,
International Conference on Supercomputing (ICS), June 2010.
Conference Slides.
-
P. Lai, S. Sur and D. K. Panda,
Designing Truly One-Sided MPI-2 RMA Intra-node Communication on Multi-core Syste
ms,
International Computing Conference (ISC'10), June 2010.
Conference Slides.
Best Paper Award at ISC '10.
-
E. P. Mancini, G. Marsh and D. K. Panda,
An MPI-Stream Hybrid Programming Model for Computational Clusters,
Int'l Symposium on Cluster Computing and the Grid (CCGrid), May 2010.
-
X. Ouyang, S. Marcarelli and D. K. Panda,
Enhancing Checkpoint Performance with Staging IO and SSD,
IEEE International Workshop on Storage Network Architecture and Parallel I/Os (
SNAPI), May 2010.
-
K. Kandalla, H. Subramoni, A. Vishnu and D. K. Panda,
Designing Topology-Aware Collective Communication Algorithms for Large Scale InfiniBand Clusters:
Case Studies with Scatter and Gather,
The 10th Workshop on Communication Architecture for Clusters (CAC 10), Apr. 2010.
-
M. Koop, P. Shamis, I. Rabinovitz and D. K. Panda,
Designing High-Performance and Resilient Message Passing on InfiniBand,
The 10th Workshop on Communication Architecture for Clusters (CAC 10), Apr. 2010.
-
X. Ouyang, K. Gopalakrishnan, T. Gangadharappa and D. K. Panda,
Fast Checkpointing by Write Aggregation with Dynamic Buffer and Interleaving on
Multicore Architecture,
Int'l Conference on High Performance Computing (HiPC'09), Dec. 2009.
Conference Slides.
-
X. Ouyang, K. Gopalakrishnan and D. K. Panda,
Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore Systems,
Int'l Conference on Parallel Processing (ICPP '09), Sept. 2009.
Conference Slides.
- T. Gangadharappa, M. Koop and D. K. Panda, Designing and Evaluating MPI-2 Dynamic Process Management Support for InfiniBand , Int'l Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2 '09), in Conjunction with ICPP '09, Sept. 2009. Conference Slides.
- J. Sridhar and D. K. Panda, Impact of Node Level Caching in MPI Job Launch Mechanisms, The 16th Euro PVM/MPI conference, September 2009. Conference Slides.
- G. Santhanaraman, T. Gangadharappa, S. Narravula, A. Mamidala, D. K. Panda, Design Alternatives for Implementing Fence Synchronization in MPI-2 One-sided Communication on InfiniBand Clusters, IEEE Cluster 2009, September 2009. Conference Slides.
- M. Koop, M. Luo, D. K. Panda, Reducing Network Contention with Mixed Workloads on Modern Multicore Clusters, IEEE Cluster 2009, September 2009. Conference Slides.
- H. Subramoni, P. Lai, M. Luo, D. K. Panda, RDMA over Ethernet - A Preliminary Study, Workshop on High Performance Interconnects for Distributed Computing (HPIDC'09), September 2009. Conference Slides.
- H. Subramoni, M. Koop, and D. K. Panda, Designing Next Generation Clusters: Evaluation of InfiniBand DDR/QDR on Intel Computing Platforms, 17th Annual Symposium on High-Performance Interconnects (HotI'09), August 2009.
- G. Santhanaraman, P. Balaji, K. Gopalakrishnan, R. Thakur, W. Gropp and D. K. Panda, Natively Supporting True One-sided Communication in MPI on Multi-core Systems with InfiniBand, Int'l Symposium on Cluster Computing and the Grid (CCGrid), May 2009.
- M. Koop, J. Sridhar and D. K. Panda, TupleQ: Fully-Asynchronous and Zero-Copy MPI over InfiniBand, 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 09), May 2009. Conference Slides.
- K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop and D. K. Panda, Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters, The 9th Workshop on Communication Architecture for Clusters (CAC 09), May 2009. Conference Slides.
- J. Sridhar, M. Koop, J. Perkins and D. K. Panda, ScELA: Scalable and Extensible Launching Architecture for Clusters, International Conference on High Performance Computing (HiPC 08), December 2008.
- M. Koop, J. Sridhar and D.K. Panda, Scalable MPI Design over InfiniBand using eXtended Reliable Connection, IEEE Cluster 2008, September 2008. Conference Slides.
- W. Huang, M. Koop and D.K. Panda, Efficient One-Copy MPI Shared Memory Communication in Virtual Machines, IEEE Cluster 2008, September 2008. Conference Slides.
- L. Chai, P. Lai, H.-W. Jin and D. K. Panda, Designing An Efficient Kernel-level and User-level Hybrid Approach for MPI Intra-node Communication on Multi-core Systems, Int'l Conference on Parallel Processing (ICPP '08), September, 2008. Conference Slides.
-
R. Kumar, A. Mamidala, M. Koop, G. Santhanaraman and D.K. Panda,
Lock-free Asynchronous Rendezvous Design for MPI Point-to-Point Communication,
EuroPVM/MPI 2008, Sept. 2008.
Conference Slides
-
M. Koop, W. Huang, K. Gopalakrishnan and D.K. Panda,
Performance Analysis and Evaluation of PCIe 2.0 and Quad-Data Rate InfiniBand
16th IEEE Symposium on Hot Interconnects, August 2008.
Conference Slides
- M. Koop, R. Kumar, and D. K. Panda, Can Software Reliability Outperform Hardware Reliability on High Performance Interconnects? A Case Study with MPI over InfiniBand, 22nd ACM International Conference on Supercomputing (ICS08), Island of Kos, Greece, June 2008. Conference Slides
- A. Mamidala, R. Kumar, D. De and D. K. Panda, MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics, Int'l Symposium on Cluster Computing and the Grid (CCGrid), Lyon, France, May 2008.Conference Slides
- M. Koop, T. Jones, and D. K. Panda, MVAPICH-Aptus: Scalable High-Performance Multi-Transport MPI over InfiniBand, IEEE International Parallel and Distributed Processing Symposium (IPDPS '08), Miami, Florida, April 2008. Conference Slides
- G. Santhanaraman, S. Narravula and D. K. Panda, Designing Passive Synchronization for MPI-2 One-Sided Communication to Maximize Overlap , IEEE International Parallel and Distributed Processing Symposium (IPDPS '08), Miami, Florida, April, 2008. Conference Slides
- R. Kumar, A. Mamidala and D. K. Panda, Scaling Alltoall Collective on Multi-core Systems, Workshop on Communication Architecture for Clusters, to be held in conjunction with IPDPS '08, Miami, Florida, April 2008. Conference Slides
- M. Koop, S. Sur and D. K. Panda, Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram, IEEE International Conference on Cluster Computing (Cluster'07), Austin, TX, September 2007. Conference Slides
- S. Narravula, A. R. Mamidala, A. Vishnu, G. Santhanaraman, and D. K. Panda, High Performance MPI over iWARP: Early Experiences, Int'l Conference on Parallel Processing, XiAn, China, September 2007. Conference Slides
- S. Sur, M. Koop, L. Chai and D. K. Panda, Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms, 15th Symposium on Hot Interconnects, August 2007. Conference Slides
- M. Koop, S. Sur, Q. Gao and D. K. Panda, High Performance MPI Design using Unreliable Datagram for Ultra-Scale InfiniBand Clusters, 21st Int'l ACM Conference on Supercomputing, June 2007. Conference Slides
-
W. Huang, J. Liu, M. Koop, B. Abali and D. K. Panda,
Nomad: Migrating OS-bypass Networks in Virtual Machines,
Third International ACM SIGPLAN/SIGOPS Conference on Virtual Execution
Environments (VEE),
San Diego, California,
June 13-15, 2007
-
A. Vishnu, M. Koop, A. Moody, A. Mamidala, S. Narravula and
D. K. Panda, Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An
MPI Perspective,
Int'l Symposium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007.
Conference Slides
-
M. Koop, T. Jones and D. K. Panda,
Reducing Connection Memory
Requirements of MPI for InfiniBand Clusters: A Message Coalescing
Approach,
Int'l Symposium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007.
Conference Slides
-
L. Chai, Q. Gao and D. K. Panda,
Understanding the Impact of
Multi-Core Architecture in Cluster Computing: A Case Study with Intel
Dual-Core System,
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid),
Rio de Janeiro - Brazil,
May 2007
- A. Vishnu, A. Mamidala, S. Narravula and D. K. Panda,
Automatic Path Migration over InfiniBand: Early Experiences,
Third International Workshop on
System Management Techniques, Processes, and Services,
to be held in conjunction with IPDPS '07,
March 2007. Conference Slides
- A. Vishnu, B. Benton and D. K. Panda,
High Performance MPI on IBM 12x InfiniBand Architecture,
International Workshop on
High-Level Parallel Programming Models and
Supportive Environments,
to be held in conjunction with IPDPS '07,
March 2007. Conference Slides
- K. Vaidyanathan, W. Huang, L. Chai and D. K. Panda,
Designing Efficient
Asynchronous Memory Operations Using Hardware Copy Engine: A Case Study
with I/OAT , International Workshop on
Communication Architecture for Clusters,
to be held in conjunction with IPDPS '07,
March 2007.
-
A. R Mamidala, S. Narravula, A. Vishnu, G. Santhanaraman, and D. K. Panda,
Using Connection-Oriented and Connection-Less
Transport on Performance and Scalability of Collective
and One-sided operations: Trade-offs and Impact,
International Symposium on Principles and Practice of Parallel
Programming (PPoPP 2007), March 2007, San Jose, California.
Conference Slides
- S. Sur, M. Koop and D. K. Panda, High-Performance and Scalable MPI
over InfiniBand with Reduced Memory Usage: An In-Depth Performance
Analysis, SC '06, November 2006.
- A. Vishnu, P. Gupta, A. Mamidala and D. K. Panda, A Software Based
Approach for Providing Network Fault Tolerance in Clusters Using the
uDAPL Interface: MPI Level Design and Performance Evaluation,
SC '06, November 2006.
- L. Chai, A. Hartono and D. K. Panda,
Designing Efficient MPI Intra-node Communication Support for Modern
Computer Architectures,
Int'l IEEE Conference on Cluster Computing, September 2006.
-
A. R. Mamidala, A. Vishnu, D. K. Panda,
Efficient Shared Memory and RDMA based design for MPI_Allgather over InfiniBand,
EuroPVM/MPI, September 2006.
- M. Koop, W. Huang, A. Vishnu and D. K. Panda,
Memory Scalability Evaluation of the Next-Generation
Intel Bensley Platform with InfiniBand,
Int'l Symposium on Hot Interconnect (HotI), August 2006.
- Q. Gao, W. Yu, W. Huang and D. K. Panda,
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand,
Int'l Conference on Parallel Processing (ICPP), August 2006.
- W. Huang, J. Liu, B. Abali and D. K. Panda,
A
Case for High Performance Computing with Virtual Machines,
Int'l Conference on Supercomputing (ICS), June 2006.
- Jiuxing Liu, Wei Huang, Bulent Abali and Dhabaleswar K. Panda, High Performance VMM-Bypass I/O in Virtual Machines , USENIX Annual Technical Conference, June 2006.
- W. Huang, G. Santhanaraman, H.-W. Jin, Q. Gao and D. K. Panda,
Design and Implementation of High
Performance MVAPICH2: MPI2 over InfiniBand,
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid), Singapore,
May 2006
- L. Chai, R. Noronha and D. K. Panda,
MPI over uDAPL: Can High Performance and
Portability Exist Across Architectures?
Int'l Sympsoium on Cluster Computing and the Grid (CCGrid), Singapore,
May 2006
-
A. Mamidala, L. Chai, H.-W. Jin and D. K. Panda,
Efficient SMP-Aware MPI-Level Broadcast over
InfiniBand's Hardware Multicast,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
Conference Slides
-
P. Balaji, S. Bhagvat, H.-W. Jin and D. K. Panda,
Asynchronous Zero-Copy Communication for
Synchronous Sockets Direct Protocol (SDP) over InfiniBand,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
-
W. Yu, R. Noronha, S. Liang and D. K. Panda,
Benefits of High Speed Interconnects to
Cluster File Systems: A Case Study with Lustre,
Communication Architecture for Clusters (CAC) Workshop,
to be held in conjunction with
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
-
P. Balaji, K. Vaidyanathan, S. Narravula, H.-W. Jin, and D. K. Panda,
Designing Next-Generation Data-Centers with Advanced Communication Proto
cols and Systems Services, Workshop on NSF Next Generation Software(NGS) Pro
gram; held in conjuction with IPDPS, Greece, 2006.
-
S. Sur, L. Chai, H.-W. Jin and D. K. Panda,
Shared Receive Queue based
Scalable MPI Design for InfiniBand Clusters,
Int'l Parallel and Distributed Processing Symposium (IPDPS '06),
April 2006, Rhode Island, Greece.
Conference Slides
- Weikuan Yu, Qi Gao and Dhabaleswar K. Panda, Adaptive Connection Management for Scalable MPI over InfiniBand , International Parallel and Distributed Processing Symposium (IPDPS 2006). Conference Slides
- S. Sur, H.-W. Jin, L. Chai and D. K. Panda RDMA Read Based Rendezvous Protocol for MPI over InfiniBand: Design Alternatives and Benefits , Symposium on Principles and Practice of Parallel Programming (PPOPP'06), March 29-31, 2006, Manhattan, New York City.
-
S. Sur, U. Bondhugula, A. Mamidala, H.-W. Jin, and D. K. Panda,
High
Performance RDMA Based All-to-all Broadcast for InfiniBand Clusters,
International Conference on High Performance Computing (HiPC 2005), December 18-21, 2005, Goa, India.
Conference Slides
- A. Vishnu, G. Santhanaraman, W. Huang, H. -W. Jin and D. K. Panda Supporting MPI-2 One Sided Communication on Multi-Rail InfiniBand Clusters: Design Challenges and Performance Benefits , International Conference on High Performance Computing (HiPC 2005), December 18-21, 2005, Goa, India. Conference Slides
- L. Chai, R. Noronha, P. Gupta, G. Brown, and D. K. Panda, Designing a Portable MPI-2 over Modern Interconnects Using uDAPL Interface, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- A. Mamidala, H. -W. Jin, and D. K. Panda, Efficient Hardware Multicast Group Management for Multiple MPI Communicators over InfiniBand, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- W. Huang, G. Santharaman, H. -W. Jin, and D. K. Panda, Design Alternatives and Performance Trade-offs for Implementing MPI-2 over InfiniBand, EuroPVM/MPI 2005, Sept. 2005. Conference Slides
- S. Sur, A. Vishnu, H. -W. Jin, W. Huang, and D. K. Panda, Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand Systems?, Hot Interconnect (HOTI 05), August, 2005. Conference Slides
- R. Noronha, and D. K. Panda, Performance Evaluation of MM5 on Clusters With Modern Interconnects: Scalability and Impact, Euro-Par, August 2005.
- H. -W. Jin, S. Sur, L. Chai, and D. K. Panda, LiMIC: Support for High-Performance MPI Intra-Node Communication on Linux Cluster, International Conference on Parallel Processing (ICPP-05), June 2005. Conference Slides
- L. Chai, S. Sur, H. -W. Jin, D. K. Panda, Analysis of Design Considerations for Optimizing Multi-Channel MPI over InfiniBand , Workshop on Communication Architecture on Clusters (CAC 05) in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2005). Technical Report. Conference Slides
- W. Huang, G. Santhanaraman, H. -W. Jin, D. K. Panda, Scheduling of MPI-2 One Sided Operations over InfiniBand, Workshop on Communication Architecture on Clusters (CAC 05) in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2005). Conference Slides
- A. Vishnu, A. R. Mamidala, H. -W. Jin and D. K. Panda, Performance Modeling of Subnet Management on Fat Tree InfiniBand Networks using OpenSM , Workshop on System Management Tools on Large Scale Parallel Systems, Held in Conjunction with IPDPS 2005. Conference Slides
- W. Yu, J. Wu and D. K. Panda, Scalable Startup of Parallel Programs over InfiniBand , International Conference on High Performance Computing, HiPC 2004, Bangalore, India. Conference Slides .
- J. Liu, A. Vishnu, D. K. Panda, Building Multirail InfiniBand Clusters: MPI-Level Design and Performance Evaluation, SuperComputing 2004 Conference (SC 04), November, 2004. Conference Slides
- A. Mamidala, J. Liu, and D. K. Panda, Efficient Barrier and Allreduce on IBA clusters using hardware multicast and adaptive algorithms, IEEE Cluster Computing 2004, Sept. 20-23 2004, San Diego, California. Conference Slides .
- G. Santhanaraman, J. Wu and D. K. Panda, Zero-Copy MPI Derived Datatype Communication over InfiniBand, EuroPVM/MPI 2004, Sept. 19-22 2004, Budapest, Hungary. Conference Slides .
- W. Jiang, J. Liu, H. -W. Jin, D. K. Panda, D. Buntinas, R. Thakur, and W. Gropp, Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters , EuroPVM/MPI 2004, Sept. 19-22 2004, Budapest, Hungary. Conference Slides .
- J. Liu, A. Mamidala, A. Vishnu, and D. K. Panda, Performance Evaluation of InfiniBand with PCI Express , Hot Interconnect 12 (HOTI 04), August, 2004. Conference Slides
- S. Sur, H. -W. Jin, and D. K. Panda, Efficient and Scalable All-to-All Exchange for InfiniBand-based Clusters. International Conference on Parallel Processing (ICPP-04), Aug. 15-18, 2004, Montreal, Quebec, Canada. Conference Slides
-
J. Liu, W. Jiang, P. Wyckoff, D. K. Panda, D. Ashton, D. Buntinas,
W. Gropp, and B. Toonen.
Design and Implementation of MPICH2 over InfiniBand with RDMA Support.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
J. Liu, A. Mamidala and D. K. Panda.
Fast and Scalable MPI-Level Broadcast using InfiniBand's Hardware Multicast Support.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
Jiesheng Wu, Pete Wyckoff, and Dhabaleswar K. Panda.
High Performance Implementation of MPI Datatype Communication over InfiniBand.
In Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
-
Jiuxing Liu and Dhabaleswar K. Panda.
Implementing Efficient and Scalable Flow Control Schemes in MPI over InfiniBand.
In Workshop on Communication Architecture for Clusters (CAC 04), Held in Conjunction with Int'l Parallel and Distributed Processing Symposium (IPDPS 04), April, 2004.
Conference Slides
-
W. Jiang, J. Liu, H. -W. Jin, D. K. Panda, W. Gropp, and R. Thakur.
High Performance MPI-2 One-Sided Communication over InfiniBand.
In 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 04), April, 2004.
Conference Slides .
-
Jiuxing Liu, Balasubramanian Chandrasekaran, Jiesheng Wu, Weihang Jiang,
Sushmitha Kini, Weikuan Yu, Darius Buntinas, Pete Wyckoff, and Dhabaleswar K. Panda.
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics.
In SuperComputing 2003 Conference, Pheonix, AZ, November, 2003.
- Sushmitha P. Kini, Jiuxing Liu, Jiesheng Wu, Pete Wyckoff, and Dhabaleswar K. Panda. Fast and Scalable Barrier using RDMA and Multicast Mechanisms for InfiniBand-Based Clusters. In Euro PVM/MPI Conference, September 29-Oct 2, 2003, Venice, Italy.
-
Jiuxing Liu, Jiesheng Wu, Sushmitha P. Kini, Peter Wyckoff, and Dhabaleswar K. Panda.
High Performance RDMA-Based MPI Implementation over InfiniBand.
In the Proceedings of 17th Annual ACM International Conference on Supercomputing. San Francisco Bay Area. June, 2003.
PhD Dissertations
- Matthew Koop, High-Performance Multi-Transport MPI Design for Ultra-Scale InfiniBand Clusters, June 2009.
- Gopalakrishnan Santhanaraman, Designing Scalable and High Performance One Sided Communication Middleware for Modern Interconnects, June 2009.
- Lei Chai, High Performance and Scalable MPI Intra-Node Communication Middleware for Multi-core Clusters, February 2009.
- Wei Huang, High Performance Network I/O in Virtual Machines Over Modern Interconnects, August 2008.
- Amith Mamidala, Scalable and High Performance Collective Communication For Next Generation Multicore InfiniBand Clusters, May 2008.
- Abhinav Vishnu, High Performance and Network Fault Tolerant MPI with Multi-Pathing Over InfiniBand, Dec 2007.
- Sayantan Sur, Scalable and High Performance MPI Design for Very Large InfiniBand Clusters, August 2007.
- Jiuxing Liu, Designing High Performance and Scalable MPI over InfiniBand, Sept 2003.
M.S. Thesis
- Karthik Gopalakrishnan Enhancing Fault Tolerance in MPI for Modern InfiniBand Clusters, August 2009.
- T. Gangadharappa, Designing Support for MPI-2 Programming Interfaces on Modern Interconnects, June 2009.
- R. Kumar, Enhancing MPI Point-to-Point and Collectives for Clusters with Onloaded/Offloaded InfiniBand Adapters, August 2008.
- S. Bhagvat, Designing and Enhancing the Sockets Direct Protocol (SDP) over iWARP and InfiniBand, August 2006.
- W. Jiang, High Performance MPICH2 One-Sided Communication Implementation over InfiniBand, , June 2004.
- B. Chandrasekharan, Micro-benchmark Level Performance Evaluation and Comparison of High Speed Cluster Interconnects September 2003.

