From f0d3e87e16326d342a930f01d9c3c6f3543b38d8 Mon Sep 17 00:00:00 2001 From: p04896573 Date: Tue, 14 May 2024 17:57:58 +0800 Subject: [PATCH] Update README_DISTRIBUTED.md --- quick_start_clean/readmes/README_DISTRIBUTED.md | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/quick_start_clean/readmes/README_DISTRIBUTED.md b/quick_start_clean/readmes/README_DISTRIBUTED.md index 0361481..4043184 100644 --- a/quick_start_clean/readmes/README_DISTRIBUTED.md +++ b/quick_start_clean/readmes/README_DISTRIBUTED.md @@ -114,6 +114,11 @@ for i in {1..3};do done ``` +## dockers上的多机提交任务 +dockers 容器上的多机任务和在主机上是相同的,只需要再其基础上满足两个要求 +- 在每个机器上拉取同样的docker和激活同样的训练环境,在docker共享的路径、数据、代码都一致 +- 在docker启动的时候保障 --network=host,和主机共享网络通信,只要机器之间能通信,在dockers中也可以通信和训练 + #### TODOs -1 完善dockers、K8s集群的分布式多机任务训练 \ No newline at end of file +1 完善K8s集群的分布式多机任务训练 \ No newline at end of file