An In-depth Performance Characterization of CPU- and GPU-based DNN Training on Modern Architectures
A. Awan, H. Subramoni, D. Panda
3rd Workshop on Machine Learning in High Performance Computing Environments, held in conjunction with SC17,
Nov 2017.