hiyouga
|
38af076a75
|
support longlora for main branch
|
2024-01-20 19:25:22 +08:00 |
hoshi-hiyouga
|
bb92cdd0db
|
Merge pull request #2201 from liu-zichen/token_embed_resize
support resize embed for zero3
|
2024-01-20 17:45:38 +08:00 |
hiyouga
|
8cbe4e9609
|
add upcast_lmhead option
|
2024-01-19 23:54:25 +08:00 |
hiyouga
|
0ff9a1fb4f
|
set use_reentrant=False
|
2024-01-19 23:29:54 +08:00 |
hiyouga
|
12043aab9c
|
fix #2249
|
2024-01-19 21:44:32 +08:00 |
hiyouga
|
b6ec112beb
|
add bf16 lora option
|
2024-01-19 16:29:03 +08:00 |
hiyouga
|
35aef8b287
|
fix function formatter
|
2024-01-18 16:01:07 +08:00 |
hiyouga
|
ddd48ce8ab
|
Update tuner.py
|
2024-01-18 15:06:02 +08:00 |
hiyouga
|
b5ef993e34
|
update wechat
|
2024-01-18 14:54:38 +08:00 |
hiyouga
|
a73a979afd
|
fix templates
|
2024-01-18 14:49:52 +08:00 |
hiyouga
|
5edf7cce0e
|
fix rm dataset
|
2024-01-18 14:45:37 +08:00 |
hiyouga
|
5ff10fac4f
|
fix pretrain data loader
|
2024-01-18 14:42:52 +08:00 |
hoshi-hiyouga
|
9986cc6dd1
|
Merge pull request #2226 from hiyouga/dev
support function calling
|
2024-01-18 14:31:28 +08:00 |
hiyouga
|
5608a0da8e
|
update readme
|
2024-01-18 14:30:48 +08:00 |
hiyouga
|
2abfe5fbc2
|
add tool hint
|
2024-01-18 13:19:09 +08:00 |
hiyouga
|
487dee066f
|
fix dataset
|
2024-01-18 12:59:30 +08:00 |
hiyouga
|
f1067d2b58
|
enable cutoff len
|
2024-01-18 12:25:42 +08:00 |
hiyouga
|
83dbfce8c3
|
add tool test
|
2024-01-18 10:26:26 +08:00 |
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
hiyouga
|
28135d787d
|
Update llamafy_internlm2.py
|
2024-01-18 01:12:31 +08:00 |
hiyouga
|
484becae1b
|
Update llamafy_internlm2.py
|
2024-01-18 01:00:16 +08:00 |
hiyouga
|
c84a387c2c
|
Update llamafy_internlm2.py
|
2024-01-18 00:49:31 +08:00 |
hiyouga
|
f99140d5e8
|
fix llamafy scripts
|
2024-01-18 00:37:37 +08:00 |
hiyouga
|
7ff4c874d2
|
fix llamafy_internlm2
|
2024-01-18 00:26:14 +08:00 |
hiyouga
|
f1d7ca77b1
|
add llamafy_internlm2
|
2024-01-18 00:17:41 +08:00 |
hiyouga
|
42859f0734
|
support export push_to_hub #2183
|
2024-01-16 23:59:42 +08:00 |
hiyouga
|
a83fb6d3ff
|
fix #2195
|
2024-01-16 23:53:50 +08:00 |
liuzc
|
a5f6a7f4fb
|
support resize embed for zero3
|
2024-01-16 15:16:20 +08:00 |
hiyouga
|
5a207bb723
|
tiny fix
|
2024-01-15 23:34:23 +08:00 |
hoshi-hiyouga
|
3aa8901994
|
Merge pull request #2194 from junuMoon/patch-1
fix: typo on README.md
|
2024-01-15 20:21:28 +08:00 |
Junu Moon(Fran)
|
7a320de097
|
fix: typo on README.md
|
2024-01-15 19:50:35 +09:00 |
hiyouga
|
bf73224f33
|
support solar 10.7B #1907
|
2024-01-14 00:30:30 +08:00 |
hiyouga
|
3c8e72f585
|
Update README_zh.md
|
2024-01-14 00:17:28 +08:00 |
hiyouga
|
ca3933dc52
|
support deepseek moe
|
2024-01-14 00:14:49 +08:00 |
hiyouga
|
d1a73fe26c
|
fix phi modules
|
2024-01-13 23:12:47 +08:00 |
hiyouga
|
9aa1a2fc17
|
fix #2147
|
2024-01-12 03:30:56 +08:00 |
hiyouga
|
4b2d11ec28
|
fix #2164
|
2024-01-12 00:27:57 +08:00 |
hoshi-hiyouga
|
7bf6612f4a
|
Merge pull request #2163 from JessyTsu1/main
请求添加"Projects using LLaMA Factory"
|
2024-01-11 23:33:29 +08:00 |
JessyTsu1
|
8c5e4a8896
|
Update README.md
|
2024-01-11 23:18:29 +08:00 |
JessyTsu1
|
cdeca0cabc
|
Update README_zh.md
|
2024-01-11 23:17:48 +08:00 |
JessyTsu1
|
d72aff5ae6
|
Update README.md
|
2024-01-11 23:17:00 +08:00 |
hiyouga
|
898ec3696a
|
fix #2161
|
2024-01-11 17:04:13 +08:00 |
hiyouga
|
1653c22438
|
improve web ui
|
2024-01-10 12:37:45 +08:00 |
hiyouga
|
05ed4e8028
|
improve model export
|
2024-01-09 22:26:24 +08:00 |
hiyouga
|
6b0705bed8
|
Update wechat.jpg
|
2024-01-09 22:10:41 +08:00 |
hiyouga
|
919acc2b0b
|
modify weight name
|
2024-01-09 20:22:47 +08:00 |
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
hiyouga
|
ebee4f6a2a
|
fix #2127
|
2024-01-09 14:49:13 +08:00 |
hiyouga
|
3ae735ffe8
|
fix #2125
|
2024-01-08 21:42:25 +08:00 |
hiyouga
|
0ed526cedf
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory
|
2024-01-08 14:31:04 +08:00 |