CPM-9G-8B/FM_9G/apps/fm9g_2b/train_configs/2.4b.json

10 lines
156 B
JSON

{
"pretrain": {
"train_iters": 1000000000,
"batch_size": 1,
"max_length": 4096,
"n_gpus": 8,
"lr": 0.01
}
}