Distributed Training Using ChainerMN

ChainerMN enables multi-node distributed deep learning

This blog post provides benchmark results using up to 128 GPUs.

ChainerMN can be used for both inner-node (i.e., multiple GPUs inside a node) and inter-node settings. For inter-node settings, we highly recommend to use high-speed interconnects such as InfiniBand.

ChainerMN examples are available on GitHub. These examples are based on the examples of Chainer and the differences are highlighted.

PreviousDistributed Training using Horovod NextOverview

Last updated 5 years ago