hiyouga
|
dc770efb14
|
add qwen2 math models
|
2024-08-09 20:20:35 +08:00 |
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
hiyouga
|
86f7099fa3
|
update scripts
|
2024-08-09 19:16:23 +08:00 |
hiyouga
|
b7ca6c8dc1
|
fix #5048
|
2024-08-05 23:48:19 +08:00 |
hoshi-hiyouga
|
9e409eadb0
|
Update README.md
|
2024-07-30 01:53:19 +08:00 |
hoshi-hiyouga
|
8d5a41f2cd
|
Update README.md
|
2024-07-30 01:52:35 +08:00 |
liudan
|
b9ed9d45cc
|
增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接
|
2024-07-29 10:58:28 +08:00 |
hiyouga
|
668654b5ad
|
tiny fix
|
2024-07-26 11:51:00 +08:00 |
hoshi-hiyouga
|
b8896b9b8b
|
Merge pull request #4970 from HardAndHeavy/add-rocm
Add ROCm support
|
2024-07-26 11:41:23 +08:00 |
hoshi-hiyouga
|
1186ad53d4
|
Update README.md
|
2024-07-26 11:29:28 +08:00 |
hoshi-hiyouga
|
f97beca23a
|
Update README.md
|
2024-07-26 11:29:09 +08:00 |
HardAndHeavy
|
c8e18a669a
|
Add ROCm support
|
2024-07-25 21:29:28 +03:00 |
khazic
|
ceba96f9ed
|
Added the reference address for TRL PPO details.
|
2024-07-25 09:03:21 +08:00 |
hiyouga
|
77cff78863
|
fix #4959
|
2024-07-24 23:44:00 +08:00 |
hoshi-hiyouga
|
5626bdc56d
|
Update README.md
|
2024-07-24 21:07:14 +08:00 |
hiyouga
|
26533c0604
|
add llama3.1
|
2024-07-24 16:20:11 +08:00 |
hiyouga
|
87346c0946
|
update readme
|
2024-07-03 19:39:05 +08:00 |
wangzhihong
|
22da47ba27
|
add LazyLLM to `Projects using LLaMA Factory` in `README.md`
|
2024-07-03 11:12:20 +08:00 |
hiyouga
|
d4e2af1fa4
|
update readme
|
2024-07-01 00:22:52 +08:00 |
hiyouga
|
d74244d568
|
fix #4398 #4592
|
2024-06-30 21:28:51 +08:00 |
hiyouga
|
0e0d69b77c
|
update readme
|
2024-06-28 06:55:19 +08:00 |
hiyouga
|
6f63050e1b
|
add Gemma2 models
|
2024-06-28 01:26:50 +08:00 |
hiyouga
|
e44a4f07f0
|
tiny fix
|
2024-06-27 20:14:48 +08:00 |
hoshi-hiyouga
|
64b131dcfa
|
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile
|
2024-06-27 20:05:26 +08:00 |
hiyouga
|
ad144c2265
|
support HQQ/EETQ #4113
|
2024-06-27 00:29:42 +08:00 |
hzhaoy
|
e19491b0f0
|
add flash-attn installation flag in Dockerfile
|
2024-06-27 00:13:30 +08:00 |
hiyouga
|
efb81b25ec
|
fix #4419
|
2024-06-25 01:51:29 +08:00 |
hiyouga
|
41086059b1
|
tiny fix
|
2024-06-25 01:15:19 +08:00 |
hoshi-hiyouga
|
5dc8fa647e
|
Update README.md
|
2024-06-25 01:03:38 +08:00 |
MengqingCao
|
d7207e8ad1
|
update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
|
2024-06-24 10:57:36 +00:00 |
hiyouga
|
e507e60638
|
update readme
|
2024-06-24 18:22:12 +08:00 |
hiyouga
|
344b9a36b2
|
tiny fix
|
2024-06-18 23:32:18 +08:00 |
hoshi-hiyouga
|
10316dd8ca
|
Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples
|
2024-06-18 23:30:19 +08:00 |
hiyouga
|
a233fbc258
|
add deepseek coder v2 #4346
|
2024-06-18 22:53:54 +08:00 |
hiyouga
|
fcb2e8e7b7
|
update readme
|
2024-06-17 18:47:24 +08:00 |
Eli Costa
|
103664203c
|
Update README.md
Add Magpie and Webinstruct to README
|
2024-06-16 11:19:25 -03:00 |
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
hiyouga
|
acd84ce535
|
update readme
|
2024-06-15 05:13:16 +08:00 |
hiyouga
|
b6e008c152
|
update examples
|
2024-06-13 03:15:06 +08:00 |
hiyouga
|
c7a5620ccc
|
add neo-sft dataset
|
2024-06-13 01:00:56 +08:00 |
hiyouga
|
713fde4259
|
fix lint
|
2024-06-13 00:48:44 +08:00 |
hiyouga
|
947a34f53b
|
fix docker compose usage
|
2024-06-13 00:07:48 +08:00 |
hiyouga
|
2ce2e5bc47
|
update readme
|
2024-06-12 17:39:12 +08:00 |
hiyouga
|
949e9908ad
|
fix #4145
Fix the docker image
|
2024-06-11 00:19:17 +08:00 |
-.-
|
483cdd9b6a
|
fix README
|
2024-06-08 23:51:56 +08:00 |
hiyouga
|
12d79f89c5
|
add ultrafeedback and fineweb #4085 #4132
|
2024-06-08 02:42:34 +08:00 |
hiyouga
|
1c7f0ab519
|
init unittest
|
2024-06-08 01:35:58 +08:00 |
hiyouga
|
2702d7e952
|
fix ppo in trl 0.8.6
|
2024-06-07 04:48:29 +08:00 |
hiyouga
|
f9e818d79c
|
fix #4120
|
2024-06-07 04:18:05 +08:00 |
hiyouga
|
8e95648850
|
add qwen2 models
|
2024-06-07 00:22:57 +08:00 |