hiyouga
|
d4e2af1fa4
|
update readme
|
2024-07-01 00:22:52 +08:00 |
hiyouga
|
d74244d568
|
fix #4398 #4592
|
2024-06-30 21:28:51 +08:00 |
hiyouga
|
0e0d69b77c
|
update readme
|
2024-06-28 06:55:19 +08:00 |
hiyouga
|
6f63050e1b
|
add Gemma2 models
|
2024-06-28 01:26:50 +08:00 |
hiyouga
|
e44a4f07f0
|
tiny fix
|
2024-06-27 20:14:48 +08:00 |
hoshi-hiyouga
|
64b131dcfa
|
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile
|
2024-06-27 20:05:26 +08:00 |
hiyouga
|
ad144c2265
|
support HQQ/EETQ #4113
|
2024-06-27 00:29:42 +08:00 |
hzhaoy
|
e19491b0f0
|
add flash-attn installation flag in Dockerfile
|
2024-06-27 00:13:30 +08:00 |
hiyouga
|
efb81b25ec
|
fix #4419
|
2024-06-25 01:51:29 +08:00 |
hiyouga
|
41086059b1
|
tiny fix
|
2024-06-25 01:15:19 +08:00 |
hoshi-hiyouga
|
ec95f942d1
|
Update README_zh.md
|
2024-06-25 01:06:59 +08:00 |
MengqingCao
|
d7207e8ad1
|
update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
|
2024-06-24 10:57:36 +00:00 |
hiyouga
|
4ea84a8333
|
update readme
|
2024-06-24 18:29:04 +08:00 |
hiyouga
|
e507e60638
|
update readme
|
2024-06-24 18:22:12 +08:00 |
hiyouga
|
344b9a36b2
|
tiny fix
|
2024-06-18 23:32:18 +08:00 |
hoshi-hiyouga
|
10316dd8ca
|
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
|
2024-06-18 23:30:19 +08:00 |
hiyouga
|
a233fbc258
|
add deepseek coder v2 #4346
|
2024-06-18 22:53:54 +08:00 |
hiyouga
|
fcb2e8e7b7
|
update readme
|
2024-06-17 18:47:24 +08:00 |
Eli Costa
|
3ec57ac239
|
Update README_zh.md
Fix details tag in datasets menus
|
2024-06-16 11:34:31 -03:00 |
Eli Costa
|
82d5c5c1e8
|
Update README_zh.md
Add Magpie and WebInstruct to README
|
2024-06-16 11:22:06 -03:00 |
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
hiyouga
|
acd84ce535
|
update readme
|
2024-06-15 05:13:16 +08:00 |
hiyouga
|
b6e008c152
|
update examples
|
2024-06-13 03:15:06 +08:00 |
hiyouga
|
c7a5620ccc
|
add neo-sft dataset
|
2024-06-13 01:00:56 +08:00 |
hiyouga
|
713fde4259
|
fix lint
|
2024-06-13 00:48:44 +08:00 |
hiyouga
|
947a34f53b
|
fix docker compose usage
|
2024-06-13 00:07:48 +08:00 |
hiyouga
|
2ce2e5bc47
|
update readme
|
2024-06-12 17:39:12 +08:00 |
hiyouga
|
949e9908ad
|
fix #4145
Fix the docker image
|
2024-06-11 00:19:17 +08:00 |
-.-
|
483cdd9b6a
|
fix README
|
2024-06-08 23:51:56 +08:00 |
hiyouga
|
12d79f89c5
|
add ultrafeedback and fineweb #4085 #4132
|
2024-06-08 02:42:34 +08:00 |
hiyouga
|
1c7f0ab519
|
init unittest
|
2024-06-08 01:35:58 +08:00 |
hiyouga
|
2702d7e952
|
fix ppo in trl 0.8.6
|
2024-06-07 04:48:29 +08:00 |
hiyouga
|
f9e818d79c
|
fix #4120
|
2024-06-07 04:18:05 +08:00 |
hiyouga
|
8e95648850
|
add qwen2 models
|
2024-06-07 00:22:57 +08:00 |
hiyouga
|
53eb2de75e
|
update readme
|
2024-06-06 16:59:18 +08:00 |
hiyouga
|
87a7822b98
|
update readme
|
2024-06-06 16:25:42 +08:00 |
hiyouga
|
cae4737907
|
lora modules: all by default
|
2024-06-06 03:53:28 +08:00 |
hiyouga
|
946f601136
|
support image input in api #3971 #4061
|
2024-06-06 02:29:55 +08:00 |
hiyouga
|
eef1e542a9
|
update readme
|
2024-06-05 16:32:32 +08:00 |
hiyouga
|
f48f5e646e
|
support glm-4
|
2024-06-05 15:16:38 +08:00 |
hiyouga
|
c4f50865ad
|
update readme
|
2024-05-30 16:40:17 +08:00 |
hiyouga
|
89ca832740
|
update readme
|
2024-05-29 18:39:11 +08:00 |
hoshi-hiyouga
|
880b4a9acf
|
Merge pull request #3930 from MengqingCao/npu
Add Ascend npu doc and dependency
|
2024-05-29 18:33:38 +08:00 |
MengqingCao
|
e14f5b37e4
|
update cann kernels url
|
2024-05-29 09:53:31 +00:00 |
hiyouga
|
087b9faa39
|
update readme
|
2024-05-28 19:35:52 +08:00 |
hiyouga
|
c8765349ba
|
update readme
|
2024-05-28 16:41:34 +08:00 |
hiyouga
|
99ee0dadd9
|
update readme
|
2024-05-28 16:19:56 +08:00 |
hiyouga
|
5d45adf47d
|
fix #3931
|
2024-05-28 13:44:22 +08:00 |
MengqingCao
|
cd67d6eeb5
|
add Ascend npu doc and dependency
|
2024-05-28 01:33:54 +00:00 |
hiyouga
|
08bd0440b5
|
add llava 1k datasets
|
2024-05-27 19:57:33 +08:00 |