wql
|
9175f7c9bb
|
fix: change to GPUS_PER_NODE=1
|
2024-08-08 09:18:29 +08:00 |
wql
|
7655337c72
|
fix: change to GPUS_PER_NODE=2
|
2024-08-08 09:03:43 +08:00 |
wql
|
12f7320b51
|
fix: change GPUS_PER_NODE to 8
|
2024-08-07 16:49:33 +08:00 |
wql
|
db532ca4b1
|
fix: modify paras in pretrain_dragonfly
|
2024-08-07 16:00:45 +08:00 |
anrongqiao
|
ed025abba3
|
fix single dataset error with exhaust with 2b models
|
2024-08-01 10:37:57 +08:00 |
anrongqiao
|
441c79f807
|
fix single dataset error with exhaust
|
2024-08-01 10:34:47 +08:00 |
anrongqiao
|
cfd2fca57c
|
add fm9g 2b and 8b models
|
2024-07-15 14:27:10 +08:00 |