HON’s Wiki # Interconnects

Home / High-Performance Computing (HPC)

Contents

General

Ethernet

Info

RDMA Over Converged Ethernet (RoCE)

Info

InfiniBand

Info

Installation (Debian)

  1. Install RDMA: apt install rdma-core
  2. Install user-space RDMA stuff: apt install ibverbs-providers rdmacm-utils infiniband-diags ibverbs-utils
  3. Install subnet manager (SM): apt install opensm
    • Only one instance is required on the network, but multiple may be used for redundancy.
    • A master SM is selected based on configured priority, with GUID as a tie breaker.
  4. Setup IPoIB:
    • Just like for Ethernet. Just specify the IB interface as the L2 device.
    • Use an appropriate MTU like 2044.
  5. Make sure ping and ping-pong is working (see examples below).

Usage

Testing

See CUDA.


hon.one | HON95/wiki | Edit page