hiyouga
|
12d79f89c5
|
add ultrafeedback and fineweb #4085 #4132
|
2024-06-08 02:42:34 +08:00 |
hiyouga
|
1c7f0ab519
|
init unittest
|
2024-06-08 01:35:58 +08:00 |
hiyouga
|
2702d7e952
|
fix ppo in trl 0.8.6
|
2024-06-07 04:48:29 +08:00 |
hiyouga
|
f9e818d79c
|
fix #4120
|
2024-06-07 04:18:05 +08:00 |
hiyouga
|
8e95648850
|
add qwen2 models
|
2024-06-07 00:22:57 +08:00 |
hiyouga
|
53eb2de75e
|
update readme
|
2024-06-06 16:59:18 +08:00 |
hiyouga
|
87a7822b98
|
update readme
|
2024-06-06 16:25:42 +08:00 |
hiyouga
|
cae4737907
|
lora modules: all by default
|
2024-06-06 03:53:28 +08:00 |
hiyouga
|
946f601136
|
support image input in api #3971 #4061
|
2024-06-06 02:29:55 +08:00 |
hiyouga
|
eef1e542a9
|
update readme
|
2024-06-05 16:32:32 +08:00 |
hiyouga
|
f48f5e646e
|
support glm-4
|
2024-06-05 15:16:38 +08:00 |
hiyouga
|
c4f50865ad
|
update readme
|
2024-05-30 16:40:17 +08:00 |
hiyouga
|
89ca832740
|
update readme
|
2024-05-29 18:39:11 +08:00 |
hoshi-hiyouga
|
880b4a9acf
|
Merge pull request #3930 from MengqingCao/npu
Add Ascend npu doc and dependency
|
2024-05-29 18:33:38 +08:00 |
MengqingCao
|
e14f5b37e4
|
update cann kernels url
|
2024-05-29 09:53:31 +00:00 |
hiyouga
|
087b9faa39
|
update readme
|
2024-05-28 19:35:52 +08:00 |
hiyouga
|
c8765349ba
|
update readme
|
2024-05-28 16:41:34 +08:00 |
hiyouga
|
99ee0dadd9
|
update readme
|
2024-05-28 16:19:56 +08:00 |
hiyouga
|
5d45adf47d
|
fix #3931
|
2024-05-28 13:44:22 +08:00 |
MengqingCao
|
cd67d6eeb5
|
add Ascend npu doc and dependency
|
2024-05-28 01:33:54 +00:00 |
hiyouga
|
08bd0440b5
|
add llava 1k datasets
|
2024-05-27 19:57:33 +08:00 |
hiyouga
|
efa4b196ca
|
add phi-3 7b/14b, mistral v0.3 models
|
2024-05-27 18:20:16 +08:00 |
hiyouga
|
5581cb2e4e
|
update readme
|
2024-05-27 18:14:02 +08:00 |
hiyouga
|
cb63b32986
|
support SimPO #3900
|
2024-05-26 23:46:33 +08:00 |
donggang
|
2f68a71fc0
|
adapted to 910B image
|
2024-05-23 09:48:22 +00:00 |
hiyouga
|
2670f6fb3d
|
update wechat
|
2024-05-21 18:22:32 +08:00 |
hiyouga
|
335501e228
|
fix #3847
|
2024-05-21 17:53:06 +08:00 |
hiyouga
|
2a67457e39
|
support paligemma
|
2024-05-21 00:01:22 +08:00 |
hiyouga
|
2bec28e328
|
update readme
|
2024-05-18 23:09:03 +08:00 |
hiyouga
|
c450ee87a3
|
improve KTO impl., replace datasets
|
2024-05-18 03:44:56 +08:00 |
hiyouga
|
d77bed4091
|
add falcon 11b
|
2024-05-17 00:08:33 +08:00 |
hiyouga
|
308edbc426
|
rename package
|
2024-05-16 18:39:08 +08:00 |
hiyouga
|
b2fc7aeb03
|
set dev version
|
2024-05-16 02:17:31 +08:00 |
hiyouga
|
a388cadfc0
|
add Yi-VL-34B model
|
2024-05-15 22:58:19 +08:00 |
hiyouga
|
73845fcc46
|
add yi-vl 6b model
|
2024-05-15 20:02:41 +08:00 |
hiyouga
|
e1f4e53915
|
add NPU docker images
|
2024-05-15 19:20:11 +08:00 |
hiyouga
|
b96d84835f
|
update readme
|
2024-05-14 23:57:08 +08:00 |
hiyouga
|
fc547ee591
|
update readme
|
2024-05-14 23:55:49 +08:00 |
hiyouga
|
c27afa296b
|
fix #3702
|
2024-05-13 18:24:35 +08:00 |
hiyouga
|
d12b8f866a
|
support Yi 1.5
|
2024-05-13 16:51:20 +08:00 |
hiyouga
|
58c522cd5c
|
remove checksum and fix ui args
|
2024-05-12 01:10:30 +08:00 |
hiyouga
|
638043ced4
|
update readme
|
2024-05-12 00:33:49 +08:00 |
hoshi-hiyouga
|
1049b29253
|
Update README_zh.md
|
2024-05-11 22:44:51 +08:00 |
BUAADreamer
|
508d474754
|
Merge branch 'hiyouga:main' into main
|
2024-05-10 20:34:41 +08:00 |
hiyouga
|
75aec4cf8e
|
resolve python 3.8 package
|
2024-05-09 16:52:27 +08:00 |
BUAADreamer
|
fdb3955448
|
add mllm processor save and Chinese-LLaVA-Med show
|
2024-05-09 13:53:39 +08:00 |
hiyouga
|
10ab83f4c4
|
add deepseek moe 236B
|
2024-05-08 16:37:54 +08:00 |
hiyouga
|
b3a9ae4085
|
update readme
|
2024-05-07 22:17:04 +08:00 |
hiyouga
|
92e9195b3c
|
update readme
|
2024-05-07 21:17:31 +08:00 |
hiyouga
|
5177f3ba90
|
update readme
|
2024-05-07 19:03:47 +08:00 |
Katehuuh
|
19a85bf52d
|
Update README_zh.md
Add Projects Nekochu/Luminia-13B-v3
|
2024-05-07 06:28:48 +02:00 |
hiyouga
|
8e09e20ece
|
update readme
|
2024-05-07 06:19:29 +08:00 |
hiyouga
|
f50c365871
|
update readme
|
2024-05-06 23:34:59 +08:00 |
hiyouga
|
f02f87c6fb
|
update example docs
|
2024-05-06 22:51:02 +08:00 |
hiyouga
|
34d33e2257
|
update docs
|
2024-05-06 21:47:00 +08:00 |
hiyouga
|
57a39783d1
|
update readme
|
2024-05-04 17:01:21 +08:00 |
hiyouga
|
d4283bb6bf
|
update readme
|
2024-05-04 00:43:53 +08:00 |
hiyouga
|
9d2ce57345
|
update readme and webui launch
|
2024-05-04 00:43:02 +08:00 |
hiyouga
|
1409654cef
|
update readme
|
2024-05-04 00:31:02 +08:00 |
hiyouga
|
245fe47ece
|
update webui and add CLIs
|
2024-05-03 02:58:23 +08:00 |
hiyouga
|
32347901d4
|
fix setup
|
2024-04-28 03:49:13 +08:00 |
hiyouga
|
5ee04d418c
|
update readme
|
2024-04-26 23:39:19 +08:00 |
hiyouga
|
031775ade8
|
update readme
|
2024-04-26 20:09:14 +08:00 |
hiyouga
|
375b25131b
|
support Qwen1.5 110B
|
2024-04-26 19:59:22 +08:00 |
hiyouga
|
e83e2fa897
|
update readme
|
2024-04-26 05:49:26 +08:00 |
hiyouga
|
27ba1b63ce
|
update readme
|
2024-04-26 05:44:30 +08:00 |
hiyouga
|
44a43ee152
|
add olmo 1.7
|
2024-04-24 05:50:50 +08:00 |
hiyouga
|
07737a3d2d
|
reenable sdpa and fast tok by default
|
2024-04-24 02:18:44 +08:00 |
hiyouga
|
1a13f05555
|
support phi-3
|
2024-04-24 00:28:53 +08:00 |
hiyouga
|
db7f3b9784
|
update readme
|
2024-04-22 17:09:17 +08:00 |
hiyouga
|
836ca05586
|
update readme
|
2024-04-22 00:51:35 +08:00 |
hiyouga
|
34d66a3a85
|
update readme
|
2024-04-22 00:42:25 +08:00 |
hiyouga
|
a1f1fac33b
|
update readme and examples
|
2024-04-22 00:37:32 +08:00 |
hiyouga
|
a83e7587a0
|
update readme
|
2024-04-22 00:21:01 +08:00 |
hiyouga
|
f58425ab45
|
fix mod stuff
|
2024-04-21 18:11:10 +08:00 |
Marco
|
620add7b9f
|
Added Mixture of Depths
|
2024-04-18 20:31:24 +02:00 |
hoshi-hiyouga
|
2aaaede247
|
support llama3
|
2024-04-19 01:13:50 +08:00 |
hiyouga
|
3b43a3b7c5
|
tiny fix
|
2024-04-18 00:22:17 +08:00 |
hiyouga
|
e2f1c6fc6a
|
update readme
|
2024-04-17 23:40:49 +08:00 |
hiyouga
|
cab0598fd0
|
add mixtral 8x22B models
|
2024-04-17 23:35:59 +08:00 |
hiyouga
|
5d62a51c12
|
update readme and gradio version
|
2024-04-16 18:09:16 +08:00 |
hiyouga
|
e3d8fc75eb
|
support badam for all stages
|
2024-04-16 17:44:48 +08:00 |
hiyouga
|
cf52911fed
|
update readme
|
2024-04-16 02:36:54 +08:00 |
hiyouga
|
6084eb7cf1
|
update readme
|
2024-04-16 02:35:36 +08:00 |
hiyouga
|
6543f3d449
|
add codegemma
|
2024-04-16 00:11:15 +08:00 |
hiyouga
|
e0dbac2845
|
support cohere commandR #3184
|
2024-04-15 23:26:42 +08:00 |
hiyouga
|
9d4c949461
|
release v0.6.2
|
2024-04-11 20:08:51 +08:00 |
hiyouga
|
a88fe8c1af
|
update readme
|
2024-04-07 00:48:24 +08:00 |
hiyouga
|
7f6e412604
|
fix requires for windows
|
2024-04-03 21:56:43 +08:00 |
hiyouga
|
49a2dfaf90
|
update vllm example
|
2024-04-02 22:45:20 +08:00 |
hiyouga
|
66b0fe4e96
|
update readme
|
2024-04-02 22:17:48 +08:00 |
hiyouga
|
7765f337c7
|
add zh readme
|
2024-04-02 20:58:45 +08:00 |
hiyouga
|
11a6c1bad6
|
update readme
|
2024-04-02 20:37:37 +08:00 |
hiyouga
|
949e5fe638
|
update readme
|
2024-04-02 20:22:11 +08:00 |
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
hiyouga
|
54b7d34908
|
add qwen1.5 moe
|
2024-04-01 21:49:40 +08:00 |
hiyouga
|
aee634cd20
|
fix #3077
|
2024-04-01 21:35:18 +08:00 |
hiyouga
|
099db6acc0
|
update readme
|
2024-03-31 18:46:34 +08:00 |
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
hiyouga
|
c1fe6ce782
|
update readme
|
2024-03-28 22:02:32 +08:00 |