deepseed专题

deepseed 单机多卡程序报错:exits with return code -7

现象:exits with return code -7原因:Setting the shm-size to a large number instead of default 64MB when creating docker container solves the problem in my case. It appears that multi-gpu training relies on