Commit Graph

2073 Commits

Author SHA1 Message Date
hoshi-hiyouga a9b3d91952
Update test_attention.py 2024-06-24 21:35:34 +08:00
stceum 3ed063f281 Bug Fix: `off` is parsed as `False` in yaml file, changed to `disabled` to avoid this. 2024-06-24 20:39:31 +08:00
MengqingCao 90c74ff251 auto-label npu issue 2024-06-24 12:27:00 +00:00
MengqingCao d7207e8ad1 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
2024-06-24 10:57:36 +00:00
hiyouga 4ea84a8333 update readme 2024-06-24 18:29:04 +08:00
hiyouga e507e60638 update readme 2024-06-24 18:22:12 +08:00
codemayq 5b897e7c35 update wechat 2024-06-22 11:57:39 +08:00
mMrBun 20e2e6fdcb Add tool_format to overwrite tool formatter template 2024-06-22 02:13:23 +08:00
hiyouga db9a1912e3 remove dup template 2024-06-22 01:31:32 +08:00
hiyouga 3ce44dda99 fix api 2024-06-22 00:00:38 +08:00
Erich Schubert 7d70ba7fb8
Print help if no arguments given 2024-06-21 09:14:21 +02:00
ancv 770f75dc83 move configure_packing to llamafactory.model.patcher and fix constants 2024-06-21 00:45:06 +07:00
hiyouga 8d4f5093cf tiny fix 2024-06-20 22:56:05 +08:00
hoshi-hiyouga a459624474
Merge pull request #4382 from MengqingCao/bugfix
upper bound numpy version to <2.0
2024-06-20 10:19:37 +08:00
MengqingCao 7d4a293033 update dependencies 2024-06-20 02:09:47 +00:00
hiyouga f22d8f9ca4 improve llamaboard 2024-06-19 23:46:03 +08:00
hiyouga 3f84411b5d fix llamaboard abort 2024-06-19 23:22:28 +08:00
hiyouga 3b040e8e0f update patcher 2024-06-19 21:27:00 +08:00
hiyouga 42e69a3c63 set dev version 2024-06-19 21:08:16 +08:00
hiyouga 87e330fee5 Update publish.yml 2024-06-19 20:46:33 +08:00
hiyouga 71327ba85a release v0.8.2 2024-06-19 20:42:09 +08:00
hiyouga 2b596fb55f fix jinja template 2024-06-19 20:03:50 +08:00
hiyouga 4cff6a4ad5 fix templates 2024-06-19 17:44:05 +08:00
codingma c48cbc371d update wechat_npu.jpg 2024-06-19 14:02:24 +08:00
Jonery 5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
hiyouga 6d2bf216ac fix bug 2024-06-19 03:49:23 +08:00
hiyouga 4f22eae8f4 use prefix to replace force system 2024-06-19 03:39:52 +08:00
hiyouga cd75b1fe9d fix tool formatter, allow parallel function #4362 2024-06-19 03:23:51 +08:00
hoshi-hiyouga c0ca42566c
Merge pull request #4173 from mMrBun/main
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format
2024-06-19 03:18:55 +08:00
hiyouga 9ab0401948 update data 2024-06-19 02:48:43 +08:00
hiyouga 344b9a36b2 tiny fix 2024-06-18 23:32:18 +08:00
hoshi-hiyouga 89a50dbfde
Merge pull request #4314 from EliMCosta/patch-2
Fix Dockerfile
2024-06-18 23:30:59 +08:00
hoshi-hiyouga 10316dd8ca
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
2024-06-18 23:30:19 +08:00
hiyouga a233fbc258 add deepseek coder v2 #4346 2024-06-18 22:53:54 +08:00
hiyouga 4bd77d8563 fix #4357 2024-06-18 22:42:45 +08:00
hoshi-hiyouga 078040babd
Merge pull request #4334 from zzxzz12345/bugfix/add-pandas-versions
Update requirements.txt
2024-06-18 22:30:35 +08:00
hoshi-hiyouga e8c518c08a
Update requirements.txt 2024-06-18 22:27:24 +08:00
hiyouga c96264bc47 fix #4335 2024-06-18 22:08:56 +08:00
Jonery 97c5235160 add example 2024-06-18 13:50:26 +08:00
Jonery 8f7c78b641 fix typo 2024-06-18 12:39:26 +08:00
Jonery 0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00
hiyouga 24c160df3d lint 2024-06-17 22:35:56 +08:00
hiyouga 7857c0990b update chat engine #4335 2024-06-17 19:07:17 +08:00
hiyouga fcb2e8e7b7 update readme 2024-06-17 18:47:24 +08:00
Jonery ea1f3ba5e0 Merge remote-tracking branch 'upstream/main' 2024-06-17 18:44:51 +08:00
Jonery b2fc9cc15f update gitigore 2024-06-17 18:29:36 +08:00
Jonery 33b4372778 adapt for badam with ds zero3 2024-06-17 18:18:10 +08:00
hiyouga e2665e71c7 fix #4326 2024-06-17 18:17:48 +08:00
hiyouga 72471ee046 Update wechat.jpg 2024-06-17 17:49:03 +08:00
hiyouga 2bf2863a58 tiny fix 2024-06-17 17:47:25 +08:00