CPM-9G-8B/FM_9G/apps/fm9g_8b/train_configs/8b.json

10 lines
151 B
JSON

{
"pretrain": {
"train_iters": 20000,
"batch_size": 1,
"max_length": 4096,
"n_gpus": 8,
"lr": 1e-5
}
}