hoshi-hiyouga
|
d0daaa01f9
|
Merge pull request #2423 from mayflower/main
Support for german sft and dpo
|
2024-02-07 15:58:20 +08:00 |
Johann-Peter Hartmann
|
49c69ea4b9
|
WS fix
|
2024-02-06 20:13:04 +01:00 |
Johann-Peter Hartmann
|
1126563505
|
add ranking to dpo dataset
|
2024-02-06 20:12:36 +01:00 |
hiyouga
|
d6775c6522
|
Update tests.yml
|
2024-02-07 01:18:22 +08:00 |
hiyouga
|
88a1bc9773
|
lint
|
2024-02-07 01:10:04 +08:00 |
hiyouga
|
aacc823b0f
|
Update pyproject.toml
|
2024-02-07 00:45:58 +08:00 |
hiyouga
|
0ae9a16b9d
|
update gc kwargs
|
2024-02-07 00:38:24 +08:00 |
hiyouga
|
ebf31b62eb
|
fix #2438
|
2024-02-06 15:23:08 +08:00 |
hiyouga
|
85622ae757
|
add models
|
2024-02-06 14:57:23 +08:00 |
hiyouga
|
ccabb5b04a
|
support qwen1.5
|
2024-02-06 00:10:51 +08:00 |
hoshi-hiyouga
|
7f48c599eb
|
fix #2436
Update test_toolcall.py
|
2024-02-05 22:55:28 +08:00 |
hoshi-hiyouga
|
b5007afb19
|
Update test_toolcall.py
|
2024-02-05 22:51:03 +08:00 |
hoshi-hiyouga
|
d4994235f0
|
Update test_toolcall.py
|
2024-02-05 22:50:43 +08:00 |
tao.jun
|
4081c88723
|
Update test_toolcall.py
Add openai version notes
|
2024-02-05 20:49:23 +08:00 |
Johann-Peter Hartmann
|
1ecea9de63
|
Merge branch 'hiyouga:main' into main
|
2024-02-04 13:55:00 +00:00 |
hiyouga
|
3dc86c4af9
|
fix #2421
|
2024-02-04 21:02:55 +08:00 |
Johann-Peter Hartmann
|
63da6294dd
|
Merge branch 'hiyouga:main' into main
|
2024-02-04 12:51:25 +00:00 |
hiyouga
|
db0ab4d601
|
fix reserved label len
|
2024-02-04 17:54:26 +08:00 |
hiyouga
|
19d33ede13
|
fix #2420
|
2024-02-04 15:51:47 +08:00 |
hiyouga
|
b988ce0a0c
|
fix #2189
|
2024-02-04 00:47:37 +08:00 |
hiyouga
|
38e63bfd28
|
bump up transformers version
|
2024-02-04 00:01:16 +08:00 |
hiyouga
|
51df865734
|
fix #2397
|
2024-02-03 23:45:31 +08:00 |
hiyouga
|
6545c02790
|
add hint for freeze #2412
|
2024-02-03 23:38:56 +08:00 |
hiyouga
|
4ecadc3512
|
fix #2376
|
2024-02-03 23:14:31 +08:00 |
hiyouga
|
901faa16cc
|
support minicpm #2404
|
2024-02-03 22:36:46 +08:00 |
Johann-Peter Hartmann
|
b0ffde6e98
|
add simple german chatml template chatml_de
|
2024-02-03 09:01:15 +01:00 |
Johann-Peter Hartmann
|
870182c3a9
|
remove comma
|
2024-02-03 08:48:39 +01:00 |
Johann-Peter Hartmann
|
5318cc650e
|
Merge branch 'hiyouga:main' into main
|
2024-02-03 08:43:12 +01:00 |
hoshi-hiyouga
|
2eaaa9522f
|
Merge pull request #2411 from lxsyz/main
fix eos_token_id=0 bug
|
2024-02-02 17:38:16 +08:00 |
Fallen Angel
|
3399c0d645
|
fix eos_token_id=0 bug
when eos_token_id=0, will never add eos_token
|
2024-02-02 17:34:48 +08:00 |
Johann-Peter Hartmann
|
a5d559a89e
|
Merge branch 'hiyouga:main' into main
|
2024-02-01 07:13:31 +01:00 |
hiyouga
|
ec70eb5d8d
|
Update wechat.jpg
|
2024-02-01 11:01:12 +08:00 |
Johann-Peter Hartmann
|
4e27950acb
|
Merge branch 'hiyouga:main' into main
|
2024-01-31 14:05:52 +01:00 |
hiyouga
|
39bd5bd524
|
fix #2388
|
2024-01-31 17:23:56 +08:00 |
hiyouga
|
521ad76552
|
fix autoset attn impl, update data readme
|
2024-01-31 11:58:07 +08:00 |
Johann-Peter Hartmann
|
d9a8301ed4
|
Add support for german datasets
|
2024-01-30 10:18:01 +01:00 |
hiyouga
|
60733275f8
|
Update wechat.jpg
|
2024-01-25 13:02:49 +08:00 |
hiyouga
|
2bc30763e9
|
fix #2320
|
2024-01-24 16:19:18 +08:00 |
hoshi-hiyouga
|
51ad35b3c7
|
Merge pull request #2319 from ftgreat/main
Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3
|
2024-01-24 15:32:26 +08:00 |
ldwang
|
c284665425
|
Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3.
Signed-off-by: ldwang <ftgreat@gmail.com>
|
2024-01-24 15:25:31 +08:00 |
ldwang
|
18923b1402
|
Add patch_mixtral_replace_moe_impl for full training Mitral using DeepSpeed Zero3.
Signed-off-by: ldwang <ftgreat@gmail.com>
|
2024-01-24 14:43:16 +08:00 |
hiyouga
|
dbaaa4546e
|
Update dataset_info.json
|
2024-01-23 00:10:32 +08:00 |
hiyouga
|
e4ba1deedf
|
add hint
|
2024-01-22 23:32:01 +08:00 |
hoshi-hiyouga
|
8e4b3a959a
|
Merge pull request #2283 from A-Cepheus/main
fix: ZeRO3 does not work with MoE models
|
2024-01-22 23:28:45 +08:00 |
hoshi-hiyouga
|
bdc9eff635
|
Update patcher.py
|
2024-01-22 23:27:39 +08:00 |
hoshi-hiyouga
|
bb5a3b85a6
|
Update tests.yml
|
2024-01-22 23:22:15 +08:00 |
hoshi-hiyouga
|
087476da99
|
Create tests.yml
|
2024-01-22 23:13:04 +08:00 |
hiyouga
|
b2fb0eca56
|
fix #2282 and update tool prompt
|
2024-01-22 22:27:30 +08:00 |
hiyouga
|
6fc2d5cc03
|
add orion models
|
2024-01-22 21:26:53 +08:00 |
A-Cepheus
|
b06a31e76a
|
🐞 fix: typo
|
2024-01-22 16:04:39 +08:00 |