Distributed Training Using ChainerMN
ChainerMN enables multi-node distributed deep learning
This blog post provides benchmark results using up to 128 GPUs.
ChainerMN can be used for both inner-node (i.e., multiple GPUs inside a node) and inter-node settings. For inter-node settings, we highly recommend to use high-speed interconnects such as InfiniBand.
ChainerMN examples are available on GitHub. These examples are based on the examples of Chainer and the differences are highlighted.
Last updated