Commit Graph

797 Commits

Author SHA1 Message Date
hiyouga 38af076a75 support longlora for main branch 2024-01-20 19:25:22 +08:00
hoshi-hiyouga bb92cdd0db
Merge pull request #2201 from liu-zichen/token_embed_resize
support resize embed for zero3
2024-01-20 17:45:38 +08:00
hiyouga 8cbe4e9609 add upcast_lmhead option 2024-01-19 23:54:25 +08:00
hiyouga 0ff9a1fb4f set use_reentrant=False 2024-01-19 23:29:54 +08:00
hiyouga 12043aab9c fix #2249 2024-01-19 21:44:32 +08:00
hiyouga b6ec112beb add bf16 lora option 2024-01-19 16:29:03 +08:00
hiyouga 35aef8b287 fix function formatter 2024-01-18 16:01:07 +08:00
hiyouga ddd48ce8ab Update tuner.py 2024-01-18 15:06:02 +08:00
hiyouga b5ef993e34 update wechat 2024-01-18 14:54:38 +08:00
hiyouga a73a979afd fix templates 2024-01-18 14:49:52 +08:00
hiyouga 5edf7cce0e fix rm dataset 2024-01-18 14:45:37 +08:00
hiyouga 5ff10fac4f fix pretrain data loader 2024-01-18 14:42:52 +08:00
hoshi-hiyouga 9986cc6dd1
Merge pull request #2226 from hiyouga/dev
support function calling
2024-01-18 14:31:28 +08:00
hiyouga 5608a0da8e update readme 2024-01-18 14:30:48 +08:00
hiyouga 2abfe5fbc2 add tool hint 2024-01-18 13:19:09 +08:00
hiyouga 487dee066f fix dataset 2024-01-18 12:59:30 +08:00
hiyouga f1067d2b58 enable cutoff len 2024-01-18 12:25:42 +08:00
hiyouga 83dbfce8c3 add tool test 2024-01-18 10:26:26 +08:00
hiyouga d9f1cae351 support function calling 2024-01-18 09:54:23 +08:00
hiyouga 28135d787d Update llamafy_internlm2.py 2024-01-18 01:12:31 +08:00
hiyouga 484becae1b Update llamafy_internlm2.py 2024-01-18 01:00:16 +08:00
hiyouga c84a387c2c Update llamafy_internlm2.py 2024-01-18 00:49:31 +08:00
hiyouga f99140d5e8 fix llamafy scripts 2024-01-18 00:37:37 +08:00
hiyouga 7ff4c874d2 fix llamafy_internlm2 2024-01-18 00:26:14 +08:00
hiyouga f1d7ca77b1 add llamafy_internlm2 2024-01-18 00:17:41 +08:00
hiyouga 42859f0734 support export push_to_hub #2183 2024-01-16 23:59:42 +08:00
hiyouga a83fb6d3ff fix #2195 2024-01-16 23:53:50 +08:00
liuzc a5f6a7f4fb support resize embed for zero3 2024-01-16 15:16:20 +08:00
hiyouga 5a207bb723 tiny fix 2024-01-15 23:34:23 +08:00
hoshi-hiyouga 3aa8901994
Merge pull request #2194 from junuMoon/patch-1
fix: typo on README.md
2024-01-15 20:21:28 +08:00
Junu Moon(Fran) 7a320de097
fix: typo on README.md 2024-01-15 19:50:35 +09:00
hiyouga bf73224f33 support solar 10.7B #1907 2024-01-14 00:30:30 +08:00
hiyouga 3c8e72f585 Update README_zh.md 2024-01-14 00:17:28 +08:00
hiyouga ca3933dc52 support deepseek moe 2024-01-14 00:14:49 +08:00
hiyouga d1a73fe26c fix phi modules 2024-01-13 23:12:47 +08:00
hiyouga 9aa1a2fc17 fix #2147 2024-01-12 03:30:56 +08:00
hiyouga 4b2d11ec28 fix #2164 2024-01-12 00:27:57 +08:00
hoshi-hiyouga 7bf6612f4a
Merge pull request #2163 from JessyTsu1/main
请求添加"Projects using LLaMA Factory"
2024-01-11 23:33:29 +08:00
JessyTsu1 8c5e4a8896
Update README.md 2024-01-11 23:18:29 +08:00
JessyTsu1 cdeca0cabc
Update README_zh.md 2024-01-11 23:17:48 +08:00
JessyTsu1 d72aff5ae6
Update README.md 2024-01-11 23:17:00 +08:00
hiyouga 898ec3696a fix #2161 2024-01-11 17:04:13 +08:00
hiyouga 1653c22438 improve web ui 2024-01-10 12:37:45 +08:00
hiyouga 05ed4e8028 improve model export 2024-01-09 22:26:24 +08:00
hiyouga 6b0705bed8 Update wechat.jpg 2024-01-09 22:10:41 +08:00
hiyouga 919acc2b0b modify weight name 2024-01-09 20:22:47 +08:00
hiyouga 4571068e1e fix #1789 2024-01-09 18:31:27 +08:00
hiyouga ebee4f6a2a fix #2127 2024-01-09 14:49:13 +08:00
hiyouga 3ae735ffe8 fix #2125 2024-01-08 21:42:25 +08:00
hiyouga 0ed526cedf Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory 2024-01-08 14:31:04 +08:00