Commit Graph

353 Commits

Author SHA1 Message Date
hiyouga d74244d568 fix #4398 #4592 2024-06-30 21:28:51 +08:00
hiyouga 0e0d69b77c update readme 2024-06-28 06:55:19 +08:00
hiyouga 6f63050e1b add Gemma2 models 2024-06-28 01:26:50 +08:00
hiyouga e44a4f07f0 tiny fix 2024-06-27 20:14:48 +08:00
hoshi-hiyouga 64b131dcfa
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile
2024-06-27 20:05:26 +08:00
hiyouga ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hzhaoy e19491b0f0 add flash-attn installation flag in Dockerfile 2024-06-27 00:13:30 +08:00
hiyouga efb81b25ec fix #4419 2024-06-25 01:51:29 +08:00
hiyouga 41086059b1 tiny fix 2024-06-25 01:15:19 +08:00
hoshi-hiyouga 5dc8fa647e
Update README.md 2024-06-25 01:03:38 +08:00
MengqingCao d7207e8ad1 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
2024-06-24 10:57:36 +00:00
hiyouga e507e60638 update readme 2024-06-24 18:22:12 +08:00
hiyouga 344b9a36b2 tiny fix 2024-06-18 23:32:18 +08:00
hoshi-hiyouga 10316dd8ca
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
2024-06-18 23:30:19 +08:00
hiyouga a233fbc258 add deepseek coder v2 #4346 2024-06-18 22:53:54 +08:00
hiyouga fcb2e8e7b7 update readme 2024-06-17 18:47:24 +08:00
Eli Costa 103664203c
Update README.md
Add Magpie and Webinstruct to README
2024-06-16 11:19:25 -03:00
hiyouga 8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga acd84ce535 update readme 2024-06-15 05:13:16 +08:00
hiyouga b6e008c152 update examples 2024-06-13 03:15:06 +08:00
hiyouga c7a5620ccc add neo-sft dataset 2024-06-13 01:00:56 +08:00
hiyouga 713fde4259 fix lint 2024-06-13 00:48:44 +08:00
hiyouga 947a34f53b fix docker compose usage 2024-06-13 00:07:48 +08:00
hiyouga 2ce2e5bc47 update readme 2024-06-12 17:39:12 +08:00
hiyouga 949e9908ad fix #4145
Fix the docker image
2024-06-11 00:19:17 +08:00
-.- 483cdd9b6a fix README 2024-06-08 23:51:56 +08:00
hiyouga 12d79f89c5 add ultrafeedback and fineweb #4085 #4132 2024-06-08 02:42:34 +08:00
hiyouga 1c7f0ab519 init unittest 2024-06-08 01:35:58 +08:00
hiyouga 2702d7e952 fix ppo in trl 0.8.6 2024-06-07 04:48:29 +08:00
hiyouga f9e818d79c fix #4120 2024-06-07 04:18:05 +08:00
hiyouga 8e95648850 add qwen2 models 2024-06-07 00:22:57 +08:00
hiyouga 53eb2de75e update readme 2024-06-06 16:59:18 +08:00
hiyouga 87a7822b98 update readme 2024-06-06 16:25:42 +08:00
hiyouga cae4737907 lora modules: all by default 2024-06-06 03:53:28 +08:00
hiyouga 946f601136 support image input in api #3971 #4061 2024-06-06 02:29:55 +08:00
hiyouga eef1e542a9 update readme 2024-06-05 16:32:32 +08:00
hiyouga f48f5e646e support glm-4 2024-06-05 15:16:38 +08:00
hiyouga c4f50865ad update readme 2024-05-30 16:40:17 +08:00
hiyouga 89ca832740 update readme 2024-05-29 18:39:11 +08:00
hoshi-hiyouga 880b4a9acf
Merge pull request #3930 from MengqingCao/npu
Add Ascend npu doc and dependency
2024-05-29 18:33:38 +08:00
MengqingCao e14f5b37e4 update cann kernels url 2024-05-29 09:53:31 +00:00
hiyouga 087b9faa39 update readme 2024-05-28 19:35:52 +08:00
hiyouga c8765349ba update readme 2024-05-28 16:41:34 +08:00
hiyouga 99ee0dadd9 update readme 2024-05-28 16:19:56 +08:00
hiyouga 5d45adf47d fix #3931 2024-05-28 13:44:22 +08:00
MengqingCao cd67d6eeb5 add Ascend npu doc and dependency 2024-05-28 01:33:54 +00:00
hiyouga 08bd0440b5 add llava 1k datasets 2024-05-27 19:57:33 +08:00
hiyouga efa4b196ca add phi-3 7b/14b, mistral v0.3 models 2024-05-27 18:20:16 +08:00
hiyouga 5581cb2e4e update readme 2024-05-27 18:14:02 +08:00
hiyouga cb63b32986 support SimPO #3900 2024-05-26 23:46:33 +08:00