forked from p04798526/LLaMA-Factory-Mirror
Merge pull request #2903 from khazic/main
Updated README with new information
This commit is contained in:
commit
a773035709
|
@ -520,6 +520,7 @@ use_cpu: false
|
|||
```bash
|
||||
deepspeed --num_gpus 8 src/train_bash.py \
|
||||
--deepspeed ds_config.json \
|
||||
--ddp_timeout 180000000 \
|
||||
... # arguments (same as above)
|
||||
```
|
||||
|
||||
|
|
|
@ -519,7 +519,9 @@ use_cpu: false
|
|||
```bash
|
||||
deepspeed --num_gpus 8 src/train_bash.py \
|
||||
--deepspeed ds_config.json \
|
||||
--ddp_timeout 180000000 \
|
||||
... # 参数同上
|
||||
|
||||
```
|
||||
|
||||
<details><summary>使用 DeepSpeed ZeRO-2 进行全参数训练的 ds_config.json 示例</summary>
|
||||
|
|
Loading…
Reference in New Issue