Commit Graph

2121 Commits

Author SHA1 Message Date
wql bbabfc674b update saves 2024-08-12 06:39:25 +00:00
wql 533ad569eb test: test commit 2024-08-12 14:34:56 +08:00
wql 5fb466926c fix: remove saves from gitignore 2024-08-12 14:31:19 +08:00
wql 154eecf708 add: add llama2_lora_sft.yaml 2024-08-12 10:49:20 +08:00
hiyouga c93d55bfb0 update readme 2024-08-10 10:17:35 +08:00
hiyouga 576a894f77 update readme 2024-08-09 20:46:02 +08:00
hiyouga c75b5b83c4 add magpie ultra dataset 2024-08-09 20:28:55 +08:00
hiyouga dc770efb14 add qwen2 math models 2024-08-09 20:20:35 +08:00
hiyouga 0a690ada6f update examples 2024-08-09 20:13:46 +08:00
hiyouga e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hoshi-hiyouga ef482394f0
Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
2024-08-09 19:51:33 +08:00
hiyouga 86f7099fa3 update scripts 2024-08-09 19:16:23 +08:00
hiyouga c87023d539 follow #5115 2024-08-09 18:03:00 +08:00
hoshi-hiyouga 51542cb15f
Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
2024-08-09 17:58:27 +08:00
hoshi-hiyouga 984961c550
Merge pull request #5072 from relic-yuexi/main
fix the deepseekcoder template to avoid repeat problem
2024-08-09 16:35:21 +08:00
hoshi-hiyouga 4f62e1cb24
Update template.py 2024-08-09 16:27:42 +08:00
“Wzw” 2fa1e0b2ad mask_history args verify valid 2024-08-08 10:12:01 +08:00
“Wzw” b5ca86cc07 fix mask_history tiny bug 2024-08-08 10:09:33 +08:00
codingma 18e455c232
Merge pull request #5109 from codemayq/fix-example
fix eval_dataset in example
2024-08-07 18:30:05 +08:00
codingma 9a48f7e957 update wechat.jpg 2024-08-07 18:29:48 +08:00
codingma 823e7c122b fix eval_dataset in example 2024-08-07 18:24:19 +08:00
moontidef 82bc15dc79 feat: add support for adammini 2024-08-07 10:08:22 +08:00
moontidef 40908a36fa fix: rename optimzer to optimizer 2024-08-07 10:05:01 +08:00
moontidef 55f32dfbf9
Merge branch 'hiyouga:main' into main 2024-08-06 00:18:45 +08:00
moontidef b82ecbedd0 fix: fix the deepseekcoder template to avoid repeat problem 2024-08-05 23:55:45 +08:00
hiyouga b7ca6c8dc1 fix #5048 2024-08-05 23:48:19 +08:00
hoshi-hiyouga c2921b9960
Merge pull request #5037 from codemayq/feature-gemma-2-2b
support gemma-2-2b
2024-08-05 23:27:37 +08:00
codingma dc09d454f2 support gemma-2-2b 2024-08-01 13:45:48 +08:00
codingma 1c05b847b2 update wechat.jpg 2024-08-01 09:51:47 +08:00
codingma 3885949a9d update wechat_npu.jpg 2024-07-30 13:45:47 +08:00
hoshi-hiyouga cd420c1938
Merge pull request #5010 from Eruly/main
Add Korean web UI (llamafactory-cli webui)
2024-07-30 01:55:54 +08:00
hoshi-hiyouga 06e17eb462
Merge pull request #4996 from LDLINGLINGLING/main
增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接
2024-07-30 01:55:30 +08:00
hoshi-hiyouga 3a49c76b65
Update README_zh.md 2024-07-30 01:55:13 +08:00
hoshi-hiyouga 9e409eadb0
Update README.md 2024-07-30 01:53:19 +08:00
hoshi-hiyouga 8d5a41f2cd
Update README.md 2024-07-30 01:52:35 +08:00
hoshi-hiyouga daa62db06f
Merge pull request #4995 from codemayq/fix-pissa
fix pissa callback
2024-07-30 01:47:25 +08:00
eruly 371009e522 Add Korean web UI (llamafactory-cli webui) 2024-07-29 13:47:13 +00:00
liudan b9ed9d45cc 增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接 2024-07-29 10:58:28 +08:00
codingma 2c1ca9f742 fix pissa save 2024-07-29 10:44:34 +08:00
hiyouga 668654b5ad tiny fix 2024-07-26 11:51:00 +08:00
hoshi-hiyouga 8a2846cfe1
Merge pull request #4892 from piamo/main
update deepseek template
2024-07-26 11:49:34 +08:00
hoshi-hiyouga 9839c6d1f6
Merge pull request #4950 from liuwwang/main and fi
fix: Repair the issue where quantization failed after merging the adapter.
2024-07-26 11:48:56 +08:00
hoshi-hiyouga b8896b9b8b
Merge pull request #4970 from HardAndHeavy/add-rocm
Add ROCm support
2024-07-26 11:41:23 +08:00
hoshi-hiyouga 3c424cf69a
Merge pull request #4961 from khazic/main
Added the reference address for TRL PPO details.
2024-07-26 11:32:29 +08:00
hoshi-hiyouga 77e7bfee79
Update README_zh.md 2024-07-26 11:30:57 +08:00
hoshi-hiyouga 1186ad53d4
Update README.md 2024-07-26 11:29:28 +08:00
hoshi-hiyouga f97beca23a
Update README.md 2024-07-26 11:29:09 +08:00
codemayq 024c49d4e0 update wechat.jpg 2024-07-26 10:01:10 +08:00
HardAndHeavy c8e18a669a Add ROCm support 2024-07-25 21:29:28 +03:00
khazic ceba96f9ed Added the reference address for TRL PPO details. 2024-07-25 09:03:21 +08:00