hoshi-hiyouga
|
aa60cd8910
|
Update label_issue.yml
|
2024-06-24 21:59:39 +08:00 |
hoshi-hiyouga
|
cc452c32c7
|
Merge pull request #4446 from stceum/bug-fix
Bug Fix: `off` is parsed as `False` in yaml file
|
2024-06-24 21:41:28 +08:00 |
hoshi-hiyouga
|
e90c424f55
|
Update parser.py
|
2024-06-24 21:37:42 +08:00 |
hoshi-hiyouga
|
a9b3d91952
|
Update test_attention.py
|
2024-06-24 21:35:34 +08:00 |
stceum
|
3ed063f281
|
Bug Fix: `off` is parsed as `False` in yaml file, changed to `disabled` to avoid this.
|
2024-06-24 20:39:31 +08:00 |
MengqingCao
|
90c74ff251
|
auto-label npu issue
|
2024-06-24 12:27:00 +00:00 |
MengqingCao
|
d7207e8ad1
|
update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
|
2024-06-24 10:57:36 +00:00 |
hiyouga
|
4ea84a8333
|
update readme
|
2024-06-24 18:29:04 +08:00 |
hiyouga
|
e507e60638
|
update readme
|
2024-06-24 18:22:12 +08:00 |
codemayq
|
5b897e7c35
|
update wechat
|
2024-06-22 11:57:39 +08:00 |
mMrBun
|
20e2e6fdcb
|
Add tool_format to overwrite tool formatter template
|
2024-06-22 02:13:23 +08:00 |
hiyouga
|
db9a1912e3
|
remove dup template
|
2024-06-22 01:31:32 +08:00 |
hiyouga
|
3ce44dda99
|
fix api
|
2024-06-22 00:00:38 +08:00 |
Erich Schubert
|
7d70ba7fb8
|
Print help if no arguments given
|
2024-06-21 09:14:21 +02:00 |
ancv
|
770f75dc83
|
move configure_packing to llamafactory.model.patcher and fix constants
|
2024-06-21 00:45:06 +07:00 |
hiyouga
|
8d4f5093cf
|
tiny fix
|
2024-06-20 22:56:05 +08:00 |
hoshi-hiyouga
|
a459624474
|
Merge pull request #4382 from MengqingCao/bugfix
upper bound numpy version to <2.0
|
2024-06-20 10:19:37 +08:00 |
MengqingCao
|
7d4a293033
|
update dependencies
|
2024-06-20 02:09:47 +00:00 |
hiyouga
|
f22d8f9ca4
|
improve llamaboard
|
2024-06-19 23:46:03 +08:00 |
hiyouga
|
3f84411b5d
|
fix llamaboard abort
|
2024-06-19 23:22:28 +08:00 |
hiyouga
|
3b040e8e0f
|
update patcher
|
2024-06-19 21:27:00 +08:00 |
hiyouga
|
42e69a3c63
|
set dev version
|
2024-06-19 21:08:16 +08:00 |
hiyouga
|
87e330fee5
|
Update publish.yml
|
2024-06-19 20:46:33 +08:00 |
hiyouga
|
71327ba85a
|
release v0.8.2
|
2024-06-19 20:42:09 +08:00 |
hiyouga
|
2b596fb55f
|
fix jinja template
|
2024-06-19 20:03:50 +08:00 |
hiyouga
|
4cff6a4ad5
|
fix templates
|
2024-06-19 17:44:05 +08:00 |
codingma
|
c48cbc371d
|
update wechat_npu.jpg
|
2024-06-19 14:02:24 +08:00 |
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
hiyouga
|
6d2bf216ac
|
fix bug
|
2024-06-19 03:49:23 +08:00 |
hiyouga
|
4f22eae8f4
|
use prefix to replace force system
|
2024-06-19 03:39:52 +08:00 |
hiyouga
|
cd75b1fe9d
|
fix tool formatter, allow parallel function #4362
|
2024-06-19 03:23:51 +08:00 |
hoshi-hiyouga
|
c0ca42566c
|
Merge pull request #4173 from mMrBun/main
Implemented the tool_formatter and tool_extractor for glm4 and Qwen2 tool_format
|
2024-06-19 03:18:55 +08:00 |
hiyouga
|
9ab0401948
|
update data
|
2024-06-19 02:48:43 +08:00 |
hiyouga
|
344b9a36b2
|
tiny fix
|
2024-06-18 23:32:18 +08:00 |
hoshi-hiyouga
|
89a50dbfde
|
Merge pull request #4314 from EliMCosta/patch-2
Fix Dockerfile
|
2024-06-18 23:30:59 +08:00 |
hoshi-hiyouga
|
10316dd8ca
|
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
|
2024-06-18 23:30:19 +08:00 |
hiyouga
|
a233fbc258
|
add deepseek coder v2 #4346
|
2024-06-18 22:53:54 +08:00 |
hiyouga
|
4bd77d8563
|
fix #4357
|
2024-06-18 22:42:45 +08:00 |
hoshi-hiyouga
|
078040babd
|
Merge pull request #4334 from zzxzz12345/bugfix/add-pandas-versions
Update requirements.txt
|
2024-06-18 22:30:35 +08:00 |
hoshi-hiyouga
|
e8c518c08a
|
Update requirements.txt
|
2024-06-18 22:27:24 +08:00 |
hiyouga
|
c96264bc47
|
fix #4335
|
2024-06-18 22:08:56 +08:00 |
Jonery
|
97c5235160
|
add example
|
2024-06-18 13:50:26 +08:00 |
Jonery
|
8f7c78b641
|
fix typo
|
2024-06-18 12:39:26 +08:00 |
Jonery
|
0f72aac8c9
|
Support distributed BAdam.
|
2024-06-18 12:27:47 +08:00 |
hiyouga
|
24c160df3d
|
lint
|
2024-06-17 22:35:56 +08:00 |
hiyouga
|
7857c0990b
|
update chat engine #4335
|
2024-06-17 19:07:17 +08:00 |
hiyouga
|
fcb2e8e7b7
|
update readme
|
2024-06-17 18:47:24 +08:00 |
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
Jonery
|
b2fc9cc15f
|
update gitigore
|
2024-06-17 18:29:36 +08:00 |
Jonery
|
33b4372778
|
adapt for badam with ds zero3
|
2024-06-17 18:18:10 +08:00 |