distributed单机多卡 #6

zhangxiaofan-star · 2023-06-09T02:56:47Z

您好，很感谢您这篇论文的工作，我收获了很多。
我在使用distributed方法进行单机多卡执行时，遇到了如下报错

您可以帮忙解答一下吗

parker-sornberger · 2023-11-15T11:59:42Z

That's interesting. Are you using the DataParallel model for multiple GPUs? I have trained on up to 4 GPUs before with no issue other than needing to change some code from model.attr to model.module.attr.

However when I have done this, all the GPUs have been on the same HPC node.

Were you ever able to get this resolved?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

distributed单机多卡 #6

distributed单机多卡 #6

zhangxiaofan-star commented Jun 9, 2023

parker-sornberger commented Nov 15, 2023

distributed单机多卡 #6

distributed单机多卡 #6

Comments

zhangxiaofan-star commented Jun 9, 2023

parker-sornberger commented Nov 15, 2023