Optically connected and reconfigurable GPU architecture for optimized peer-to-peer access

Erik Anderson, Jorge González, Alexander Gazman, Rodolfo Azevedo, Keren Bergman

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Increasing industry interest in the optimization of inter-GPU communication has motivated this work to explore new ways to enable peer-to-peer access. Specifically, this paper investigates how reconfigurable optical links between GPUs in multi-GPU servers can allow for minimized memory transfer latencies for given machine learning applications. Silicon photonics (SiP) is proposed as the enabling technology for such a reconfigurable architecture due to the potential for scalable and cost-efficient production. We evaluated our architecture using traffic obtained from an NVLink-connected 8 GPU server executing a set of machine learning models including AlexNet, DenseNet, NASNet, ResNet, MobileNet, and VGG16. Our results show up to 24.91% reduction of the total relative transmission latency (RTL) between peers.

Original languageEnglish
Title of host publicationMEMSYS 2018 - Proceedings of the International Symposium on Memory Systems
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450364751
DOIs
StatePublished - 1 Oct 2018
Externally publishedYes
Event2018 International Symposium on Memory Systems, MEMSYS 2018 - Alexandria, United States
Duration: 1 Oct 20184 Oct 2018

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2018 International Symposium on Memory Systems, MEMSYS 2018
Country/TerritoryUnited States
CityAlexandria
Period1/10/184/10/18

Fingerprint

Dive into the research topics of 'Optically connected and reconfigurable GPU architecture for optimized peer-to-peer access'. Together they form a unique fingerprint.

Cite this