codemayq
|
026e88ab74
|
update wechat
|
2024-05-27 10:04:51 +08:00 |
hiyouga
|
cb63b32986
|
support SimPO #3900
|
2024-05-26 23:46:33 +08:00 |
BUAADreamer
|
60170a1da4
|
Merge branch 'hiyouga:main' into main
|
2024-05-25 14:18:49 +08:00 |
hiyouga
|
063f91cc80
|
fix #3853
|
2024-05-24 23:29:45 +08:00 |
seanzhang-zhichen
|
27cb51f7f8
|
Merge branch 'main' into add_dataset_sample_num
|
2024-05-24 15:57:47 +08:00 |
BUAADreamer
|
047a06a1e5
|
Merge branch 'hiyouga:main' into main
|
2024-05-24 09:50:00 +08:00 |
hiyouga
|
3a023bca2a
|
refactor data preprocessing, fix mllm rlhf
|
2024-05-24 04:08:25 +08:00 |
hoshi-hiyouga
|
a506f3628b
|
Merge pull request #3876 from dongdongqiang2018/main
added adapted to 910B image
|
2024-05-24 01:54:30 +08:00 |
hiyouga
|
de0e67aff1
|
fix paligemma sft
requires transformers>=4.41.1
|
2024-05-24 00:23:40 +08:00 |
hiyouga
|
67ebc7b388
|
fix oom issues in export
|
2024-05-23 23:32:45 +08:00 |
donggang
|
2f68a71fc0
|
adapted to 910B image
|
2024-05-23 09:48:22 +00:00 |
BUAADreamer
|
8d53ec2b5f
|
Merge branch 'hiyouga:main' into main
|
2024-05-21 22:18:20 +08:00 |
hiyouga
|
7134fb02bb
|
fix paligemma sft
|
2024-05-21 20:03:09 +08:00 |
hiyouga
|
4d647ddba5
|
Update README_zh.md
|
2024-05-21 18:30:59 +08:00 |
hiyouga
|
2670f6fb3d
|
update wechat
|
2024-05-21 18:22:32 +08:00 |
hiyouga
|
335501e228
|
fix #3847
|
2024-05-21 17:53:06 +08:00 |
hiyouga
|
789e73b0f4
|
Update wechat.jpg
|
2024-05-21 17:09:43 +08:00 |
BUAADreamer
|
29a6d5bdb8
|
support pretraining of llava
|
2024-05-21 08:57:14 +08:00 |
hiyouga
|
2a67457e39
|
support paligemma
|
2024-05-21 00:01:22 +08:00 |
hiyouga
|
e55c85ac72
|
fix paligemma data preprocess
|
2024-05-20 23:51:32 +08:00 |
hiyouga
|
542229abb3
|
fix paligemma inference
|
2024-05-20 23:36:43 +08:00 |
hiyouga
|
7262679666
|
fix #3818
|
2024-05-20 21:43:19 +08:00 |
hiyouga
|
9b0f4d7602
|
add kto to webui
|
2024-05-20 21:20:25 +08:00 |
zhangzc
|
d956041640
|
fix conflict
|
2024-05-20 17:10:01 +08:00 |
hiyouga
|
d52fae2fa8
|
fix chat engines
do not use pop(key, default) since api assigns None to dict values
|
2024-05-20 00:36:43 +08:00 |
hoshi-hiyouga
|
aa0bca49e9
|
Merge pull request #3812 from ycjcl868/feat/chat-support-system-prompt
feat: cli chat support system_message
|
2024-05-20 00:31:32 +08:00 |
hoshi-hiyouga
|
a0e8d3d159
|
Update vllm_engine.py
|
2024-05-20 00:31:04 +08:00 |
hoshi-hiyouga
|
a943a1034b
|
Update hf_engine.py
|
2024-05-20 00:30:45 +08:00 |
hoshi-hiyouga
|
a1fa7aa63b
|
Update generating_args.py
|
2024-05-20 00:29:31 +08:00 |
hoshi-hiyouga
|
896c656185
|
Update chat_model.py
|
2024-05-20 00:29:12 +08:00 |
hiyouga
|
10573e1639
|
fix jinja template
|
2024-05-19 23:38:30 +08:00 |
ycjcl868
|
a08ba254c8
|
feat: cli chat support system_message
|
2024-05-19 23:17:46 +08:00 |
hiyouga
|
31a0564d4f
|
fix zero2 high ram usage
|
2024-05-19 21:53:54 +08:00 |
hiyouga
|
70214b71b1
|
fix hf gen args
|
2024-05-19 19:39:32 +08:00 |
hiyouga
|
8ee8ac6eba
|
fix envs
|
2024-05-19 18:27:18 +08:00 |
hiyouga
|
1ebc890a5f
|
fix #3807
|
2024-05-19 17:07:57 +08:00 |
hiyouga
|
2bec28e328
|
update readme
|
2024-05-18 23:09:03 +08:00 |
hiyouga
|
3c2a992caa
|
safe output path in webui
|
2024-05-18 22:42:28 +08:00 |
hiyouga
|
d43822fcc2
|
fix jetmoe z3 block
|
2024-05-18 22:28:45 +08:00 |
hiyouga
|
a851056229
|
improve data process logger
|
2024-05-18 22:02:42 +08:00 |
hiyouga
|
ca48f90f1e
|
update data readme
|
2024-05-18 21:37:38 +08:00 |
hiyouga
|
18cbf8561d
|
update data readme
|
2024-05-18 21:15:20 +08:00 |
hiyouga
|
0edc16769f
|
fix #3803
|
2024-05-18 16:13:14 +08:00 |
hoshi-hiyouga
|
73d4a8e655
|
Merge pull request #3799 from hiyouga/dev
improve KTO impl, replace datasets
|
2024-05-18 03:49:13 +08:00 |
hiyouga
|
c450ee87a3
|
improve KTO impl., replace datasets
|
2024-05-18 03:44:56 +08:00 |
hoshi-hiyouga
|
33a354548e
|
Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
|
2024-05-18 03:07:18 +08:00 |
hoshi-hiyouga
|
d7ff49f245
|
Merge pull request #3794 from jue-jue-zi/main
feat: pass the `max_lora_rank` parameter to vLLM backend
|
2024-05-17 16:17:30 +08:00 |
hoshi-hiyouga
|
9646727453
|
Update model_args.py
|
2024-05-17 16:16:41 +08:00 |
juejuezi
|
b20d62ba3c
|
feat: pass the `max_lora_rank` parameter to vLLM backend
|
2024-05-17 16:07:39 +08:00 |
hiyouga
|
8af9817605
|
add deepseek v2 lite model
|
2024-05-17 13:25:36 +08:00 |