Past Talks

38th IEEE International Parallel & Distributed Processing Symposium - San Francisco, California
(May 27 - 31, 2024)

Time Location Event Speaker(s)

Monday, May 27

8:45AM - 5:00PM None

Heterogeneity in Computing

[Workshop]

DK Panda
H. Subramoni
3:30PM - 5:00PM None

Impact of LLMs and Generative AI on Future Heterogeneous Systems?

[Panel]

DK Panda

Thursday, May 30

8:30AM - 10:30AM None

HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid Memory Copy Ordering and Non-Temporal Instructions

[Talk]

B. Ramesh
N. Contini
N. Alnaasan
K. Suresh
M. Abduljabbar
A. Shafi
H. Subramoni
DK Panda
1:30PM - 2:50PM None

Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference

[Talk]

J. Yao
Q. Anthony
A. Shafi
H. Subramoni
DK Panda

Friday, May 31

11:30AM - 12:00PM None

PML-MPI: A Pre-Trained ML Framework for Efficient Collective Algorithm Selection in MPI

[Talk]

M. Han
G. Kuncham
B. Michalowicz
R. Vaidya
M. Abduljabbar
A. Shafi
H. Subramoni
DK Panda

ISC HIGH PERFORMANCE 2024 - Hamburg Germany
(May 12 - 16, 2024)

Time Location Event Speaker(s)

Sunday, May 12

All Times in Central European Summer Time
9:00AM - 1:00PM Hall Y2 - 2nd floor

High-Performance and Smart Networking Technologies for HPC and AI

[Tutorial]

H. Subramoni
DK Panda

Monday, May 13

All Times in Central European Summer Time
3:00PM - 4:00PM Foyer D-G - 2nd floor

High-Performance Semi-Supervised Learning with HARVEST: A Distributed Computer Vision Framework for Expert Labeling

[Poster Presentation] [Best Poster Finalist]

N. Alnaasan
3:00PM - 4:00PM Foyer D-G - 2nd floor

Profiling, Storing and Monitoring HPC Communication Data at Scale by OSU INAM

[Poster Presentation]

H. Subramoni
3:00PM - 4:00PM Foyer D-G - 2nd floor

A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

[Poster Presentation]

DK Panda
3:00PM - 4:00PM Foyer D-G - 2nd floor

High Performance & Scalable MPI Library Over Broadcom RoCE

[Poster Presentation]

S. Xu

Tuesday, May 14

All Times in Central European Summer Time
11:10AM - 11:35AM Hall F - 2nd floor

Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters

[Paper Presentation]

H. Subramoni
2:30PM - 3:30PM Hall G1 - 2nd floor

Research Poster Pitch & Awarding

[Poster Presentation]

N. Alnaasan
S. Xu
H. Subramoni
DK Panda

Thursday, May 16

All Times in Central European Summer Time
2:00PM - 6:00PM Hall Y7 - 2nd floor

Ninth International Workshop on Communication Architectures for HPC, Big Data, Deep Learning and Clouds at Extreme Scale

[Workshop]

H. Subramoni
A. Shafi
DK Panda

The 20th annual OpenFabrics Alliance (OFA) Virtual Workshop - Online
(Apr 22 - 23, 2024)

Time Location Event Speaker(s)

Monday, April 22

All Times in PT
10:15AM - 10:45AM Online

Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters

[Talk]

H. Subramoni
Q. Zhou
11:00AM - 11:30AM Online

High Performance & Scalable MPI library over Broadcom RoCE

[Talk]

M. Abduljabbar
H. Shah, Broadcom Inc.
S. Xu
1:00PM - 1:30PM Online

Scaling Large Language Model Training using Hybrid GPU-based Compression in MVAPICH

[Talk]

A. Shafi
L. Xu

Tuesday, April 23

All Times in PT
10:15AM - 10:45AM Online

Optimized All-to-all Connection Establishment for High-Performance MPI Libraries over InfiniBand

[Talk]

M. Abduljabbar
DK Panda
11:15AM - 11:45AM Online

Designing In-Network Computing Aware Reduction Collectives in MPI

[Talk]

DK Panda
B. Ramesh

Nvidia GPU Technology Conference 2024 - San Jose, CA
(Mar 17 - 21, 2024)

Time Location Event Speaker(s)

Monday, March 18

All Times in PST
4:00PM - 6:00PM SJCC West Lobby (L2)

DPU-Bench A New Microbenchmark Suite to Measure the Offload Efficiency of SmartNICs

[Poster Presentation]

B. Michalowicz
4:00PM - 6:00PM SJCC West Lobby (L2)

MCR-DL: Mix and Match Communication Runtime for Deep Learning

[Poster Presentation]

Q. Anthony
4:00PM - 6:00PM SJCC West Lobby (L2)

AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters

[Poster Presentation]

N. Alnaasan

Tuesday, March 19

All Times in PST
4:00PM - 4:25PM SJCC 210F (L2)

Accelerating HPC and AI Applications with Offloading to BlueField DPUs: Strategies and Benefits

[Presentation-Demo-Discussion]

DK Panda

Wednesday, March 20

All Times in PST
4:00PM - 4:25PM SJCC LL20C (LL)

Accelerating Deep Learning Applications with GPU-Based On-the-Fly Compression

[Presentation-Demo-Discussion]

H. Subramoni

Thursday, March 21

All Times in PST
10:00AM - 10:25AM SJCC 212B (L2)

A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

[Presentation-Demo-Discussion]

A. Shafi