Commit Graph

372 Commits

Author SHA1 Message Date
hiyouga dc770efb14 add qwen2 math models 2024-08-09 20:20:35 +08:00
hiyouga e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hiyouga 86f7099fa3 update scripts 2024-08-09 19:16:23 +08:00
hiyouga b7ca6c8dc1 fix #5048 2024-08-05 23:48:19 +08:00
hoshi-hiyouga 9e409eadb0
Update README.md 2024-07-30 01:53:19 +08:00
hoshi-hiyouga 8d5a41f2cd
Update README.md 2024-07-30 01:52:35 +08:00
liudan b9ed9d45cc 增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接 2024-07-29 10:58:28 +08:00
hiyouga 668654b5ad tiny fix 2024-07-26 11:51:00 +08:00
hoshi-hiyouga b8896b9b8b
Merge pull request #4970 from HardAndHeavy/add-rocm
Add ROCm support
2024-07-26 11:41:23 +08:00
hoshi-hiyouga 1186ad53d4
Update README.md 2024-07-26 11:29:28 +08:00
hoshi-hiyouga f97beca23a
Update README.md 2024-07-26 11:29:09 +08:00
HardAndHeavy c8e18a669a Add ROCm support 2024-07-25 21:29:28 +03:00
khazic ceba96f9ed Added the reference address for TRL PPO details. 2024-07-25 09:03:21 +08:00
hiyouga 77cff78863 fix #4959 2024-07-24 23:44:00 +08:00
hoshi-hiyouga 5626bdc56d
Update README.md 2024-07-24 21:07:14 +08:00
hiyouga 26533c0604 add llama3.1 2024-07-24 16:20:11 +08:00
hiyouga 87346c0946 update readme 2024-07-03 19:39:05 +08:00
wangzhihong 22da47ba27
add LazyLLM to `Projects using LLaMA Factory` in `README.md` 2024-07-03 11:12:20 +08:00
hiyouga d4e2af1fa4 update readme 2024-07-01 00:22:52 +08:00
hiyouga d74244d568 fix #4398 #4592 2024-06-30 21:28:51 +08:00
hiyouga 0e0d69b77c update readme 2024-06-28 06:55:19 +08:00
hiyouga 6f63050e1b add Gemma2 models 2024-06-28 01:26:50 +08:00
hiyouga e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hoshi-hiyouga 64b131dcfa
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile
2024-06-27 20:05:26 +08:00
hiyouga ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hzhaoy e19491b0f0 add flash-attn installation flag in Dockerfile 2024-06-27 00:13:30 +08:00
hiyouga efb81b25ec fix #4419 2024-06-25 01:51:29 +08:00
hiyouga 41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hoshi-hiyouga 5dc8fa647e
Update README.md 2024-06-25 01:03:38 +08:00
MengqingCao d7207e8ad1 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
2024-06-24 10:57:36 +00:00
hiyouga e507e60638 update readme 2024-06-24 18:22:12 +08:00
hiyouga 344b9a36b2 tiny fix 2024-06-18 23:32:18 +08:00
hoshi-hiyouga 10316dd8ca
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
2024-06-18 23:30:19 +08:00
hiyouga a233fbc258 add deepseek coder v2 #4346 2024-06-18 22:53:54 +08:00
hiyouga fcb2e8e7b7 update readme 2024-06-17 18:47:24 +08:00
Eli Costa 103664203c
Update README.md
Add Magpie and Webinstruct to README
2024-06-16 11:19:25 -03:00
hiyouga 8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga acd84ce535 update readme 2024-06-15 05:13:16 +08:00
hiyouga b6e008c152 update examples 2024-06-13 03:15:06 +08:00
hiyouga c7a5620ccc add neo-sft dataset 2024-06-13 01:00:56 +08:00
hiyouga 713fde4259 fix lint 2024-06-13 00:48:44 +08:00
hiyouga 947a34f53b fix docker compose usage 2024-06-13 00:07:48 +08:00
hiyouga 2ce2e5bc47 update readme 2024-06-12 17:39:12 +08:00
hiyouga 949e9908ad fix #4145
Fix the docker image
2024-06-11 00:19:17 +08:00
-.- 483cdd9b6a fix README 2024-06-08 23:51:56 +08:00
hiyouga 12d79f89c5 add ultrafeedback and fineweb #4085 #4132 2024-06-08 02:42:34 +08:00
hiyouga 1c7f0ab519 init unittest 2024-06-08 01:35:58 +08:00
hiyouga 2702d7e952 fix ppo in trl 0.8.6 2024-06-07 04:48:29 +08:00
hiyouga f9e818d79c fix #4120 2024-06-07 04:18:05 +08:00
hiyouga 8e95648850 add qwen2 models 2024-06-07 00:22:57 +08:00