Commit Graph

330 Commits

Author SHA1 Message Date
hiyouga d14edd350d add extra requires 2024-08-27 12:52:12 +08:00
hiyouga 72bc8f0111 support liger kernel 2024-08-27 11:20:14 +08:00
hiyouga 3804ddec9e update readme 2024-08-19 23:32:04 +08:00
codingma 625a0e32c4 add tutorial and doc links 2024-08-13 16:13:10 +08:00
hiyouga c93d55bfb0 update readme 2024-08-10 10:17:35 +08:00
hiyouga 576a894f77 update readme 2024-08-09 20:46:02 +08:00
hiyouga c75b5b83c4 add magpie ultra dataset 2024-08-09 20:28:55 +08:00
hiyouga dc770efb14 add qwen2 math models 2024-08-09 20:20:35 +08:00
hiyouga e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
hiyouga 86f7099fa3 update scripts 2024-08-09 19:16:23 +08:00
hiyouga b7ca6c8dc1 fix #5048 2024-08-05 23:48:19 +08:00
hoshi-hiyouga 3a49c76b65
Update README_zh.md 2024-07-30 01:55:13 +08:00
liudan b9ed9d45cc 增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接 2024-07-29 10:58:28 +08:00
hiyouga 668654b5ad tiny fix 2024-07-26 11:51:00 +08:00
hoshi-hiyouga 77e7bfee79
Update README_zh.md 2024-07-26 11:30:57 +08:00
khazic ceba96f9ed Added the reference address for TRL PPO details. 2024-07-25 09:03:21 +08:00
hiyouga 77cff78863 fix #4959 2024-07-24 23:44:00 +08:00
hoshi-hiyouga 71d3e60713
Update README_zh.md 2024-07-24 21:08:42 +08:00
hiyouga 26533c0604 add llama3.1 2024-07-24 16:20:11 +08:00
hiyouga 87346c0946 update readme 2024-07-03 19:39:05 +08:00
wangzhihong 6f8f53f879
Update README_zh.md 2024-07-03 14:59:09 +08:00
hiyouga d4e2af1fa4 update readme 2024-07-01 00:22:52 +08:00
hiyouga d74244d568 fix #4398 #4592 2024-06-30 21:28:51 +08:00
hiyouga 0e0d69b77c update readme 2024-06-28 06:55:19 +08:00
hiyouga 6f63050e1b add Gemma2 models 2024-06-28 01:26:50 +08:00
hiyouga e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hoshi-hiyouga 64b131dcfa
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile
2024-06-27 20:05:26 +08:00
hiyouga ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hzhaoy e19491b0f0 add flash-attn installation flag in Dockerfile 2024-06-27 00:13:30 +08:00
hiyouga efb81b25ec fix #4419 2024-06-25 01:51:29 +08:00
hiyouga 41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hoshi-hiyouga ec95f942d1
Update README_zh.md 2024-06-25 01:06:59 +08:00
MengqingCao d7207e8ad1 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
2024-06-24 10:57:36 +00:00
hiyouga 4ea84a8333 update readme 2024-06-24 18:29:04 +08:00
hiyouga e507e60638 update readme 2024-06-24 18:22:12 +08:00
hiyouga 344b9a36b2 tiny fix 2024-06-18 23:32:18 +08:00
hoshi-hiyouga 10316dd8ca
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
2024-06-18 23:30:19 +08:00
hiyouga a233fbc258 add deepseek coder v2 #4346 2024-06-18 22:53:54 +08:00
hiyouga fcb2e8e7b7 update readme 2024-06-17 18:47:24 +08:00
Eli Costa 3ec57ac239
Update README_zh.md
Fix details tag in datasets menus
2024-06-16 11:34:31 -03:00
Eli Costa 82d5c5c1e8
Update README_zh.md
Add Magpie and WebInstruct to README
2024-06-16 11:22:06 -03:00
hiyouga 8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga acd84ce535 update readme 2024-06-15 05:13:16 +08:00
hiyouga b6e008c152 update examples 2024-06-13 03:15:06 +08:00
hiyouga c7a5620ccc add neo-sft dataset 2024-06-13 01:00:56 +08:00
hiyouga 713fde4259 fix lint 2024-06-13 00:48:44 +08:00
hiyouga 947a34f53b fix docker compose usage 2024-06-13 00:07:48 +08:00
hiyouga 2ce2e5bc47 update readme 2024-06-12 17:39:12 +08:00
hiyouga 949e9908ad fix #4145
Fix the docker image
2024-06-11 00:19:17 +08:00
-.- 483cdd9b6a fix README 2024-06-08 23:51:56 +08:00