Unified Collective Communication (UCC) was codesigned with industry partners for PyTorch-based deep learning recommender model training on multi-rail GPU platforms. UCC has been specifically designed and implemented for high-performance PGAS applications and runtimes. It serves as a drop-in replacement for HCOLL and will gradually assume the role of default collective library once UCC fully implements the range of HCOLL's hierarchical algorithms.
For further information on what UCC is and how to use it, please see https://github.com/openucx/ucc
Please see UCC PyTorch integration layer, Torch_UCC at https://github.com/facebookresearch/torch_ucc
Starting with HPC-X v2.50, UCC is the default collective framework for Open MPI and Open SHMEM collectives.
Last updated: