hiyouga
|
047313f48e
|
update examples
|
2024-05-06 23:07:55 +08:00 |
hiyouga
|
f02f87c6fb
|
update example docs
|
2024-05-06 22:51:02 +08:00 |
hiyouga
|
34d33e2257
|
update docs
|
2024-05-06 21:47:00 +08:00 |
Oscar
|
eeb415f6fa
|
Fix badam example outdated argument
|
2024-05-05 23:35:19 +08:00 |
hiyouga
|
245fe47ece
|
update webui and add CLIs
|
2024-05-03 02:58:23 +08:00 |
hiyouga
|
39e964a97a
|
Update prepare.sh
|
2024-05-02 17:16:02 +08:00 |
hiyouga
|
fc67b736ba
|
fix llava qlora
|
2024-04-26 18:00:23 +08:00 |
hiyouga
|
cd3a960f81
|
add llava to llamaboard
|
2024-04-26 06:41:35 +08:00 |
hiyouga
|
e057c8de48
|
support mllm hf inference
|
2024-04-26 05:34:58 +08:00 |
BUAADreamer
|
68cdd9a020
|
Merge branch 'hiyouga:main' into main
|
2024-04-25 20:02:50 +08:00 |
BUAADreamer
|
c6dd89918f
|
merge data part to the text stream
|
2024-04-25 19:19:59 +08:00 |
hiyouga
|
3a7c1286ce
|
add export_device in webui #3333
|
2024-04-25 19:02:32 +08:00 |
BUAADreamer
|
cfb485eddf
|
add llava and instructblip
|
2024-04-25 00:22:43 +08:00 |
BUAADreamer
|
e1afbea68f
|
add multimodal LLM BLIP-2 and InstructBLIP
|
2024-04-23 19:22:42 +08:00 |
BUAADreamer
|
4f3d558f67
|
add multimodal LLM BLIP-2 and InstructBLIP
|
2024-04-23 18:47:03 +08:00 |
BUAADreamer
|
cde4dfe569
|
Merge branch 'hiyouga:main' into main
|
2024-04-23 18:46:12 +08:00 |
BUAADreamer
|
4dcb11eab7
|
add multimodal LLM BLIP-2 and InstructBLIP
|
2024-04-23 18:45:43 +08:00 |
hiyouga
|
2efd9b6ba0
|
update examples
|
2024-04-23 18:29:46 +08:00 |
hiyouga
|
a1f1fac33b
|
update readme and examples
|
2024-04-22 00:37:32 +08:00 |
hiyouga
|
ddbd29d777
|
remove extras
|
2024-04-22 00:35:41 +08:00 |
hiyouga
|
5c62881c5a
|
fix bug in galore optimizer
|
2024-04-21 18:53:22 +08:00 |
hiyouga
|
f58425ab45
|
fix mod stuff
|
2024-04-21 18:11:10 +08:00 |
Marco
|
620add7b9f
|
Added Mixture of Depths
|
2024-04-18 20:31:24 +02:00 |
hiyouga
|
e3d8fc75eb
|
support badam for all stages
|
2024-04-16 17:44:48 +08:00 |
hoshi-hiyouga
|
57dcd91e17
|
Update sft.sh
|
2024-04-16 17:25:40 +08:00 |
Jonery
|
7ecb61822b
|
resolve gradient checkpointing issue.
|
2024-04-16 12:05:27 +08:00 |
Jonery
|
06c8908d3f
|
Feature BAdam
|
2024-04-15 23:15:27 +08:00 |
hiyouga
|
cce52351b5
|
update examples
|
2024-04-15 22:14:34 +08:00 |
khazic
|
fe5d3bb8f0
|
Upgrade README.md
|
2024-04-13 20:50:49 +08:00 |
khazic
|
47111ce506
|
Added specimens for single-card full parameter prediction
|
2024-04-13 20:45:19 +08:00 |
hiyouga
|
b87f8f1519
|
update examples
|
2024-04-04 14:48:21 +08:00 |
hiyouga
|
fc7f1cc365
|
update examples
|
2024-04-02 21:09:25 +08:00 |
hiyouga
|
7765f337c7
|
add zh readme
|
2024-04-02 20:58:45 +08:00 |
hiyouga
|
f22eaeb5bc
|
update examples
|
2024-04-02 20:51:21 +08:00 |
hiyouga
|
31ffbde24d
|
update examples
|
2024-04-02 20:41:49 +08:00 |
hiyouga
|
11a6c1bad6
|
update readme
|
2024-04-02 20:37:37 +08:00 |
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
hiyouga
|
d0842f6828
|
update webui
|
2024-04-01 16:23:28 +08:00 |
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
hiyouga
|
8c77b10912
|
update trainers
|
2024-03-28 18:16:27 +08:00 |
hiyouga
|
b29d5560f1
|
fix #2981
|
2024-03-26 17:53:04 +08:00 |
hiyouga
|
8408225162
|
support fsdp + qlora
|
2024-03-21 00:36:06 +08:00 |
hiyouga
|
72367307df
|
improve lora+ impl.
|
2024-03-13 23:32:51 +08:00 |
齐保元
|
a0965cd62c
|
[FEATURE]: ADD LORA+ ALGORITHM
|
2024-03-13 19:43:27 +08:00 |
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
hiyouga
|
4c00bcdcae
|
update examples
|
2024-03-09 02:30:37 +08:00 |
hiyouga
|
10be2f0ecc
|
fix aqlm version
|
2024-03-09 00:09:09 +08:00 |
hiyouga
|
8a45213440
|
fix example params
|
2024-03-08 20:41:43 +08:00 |
hiyouga
|
33a4c24a8a
|
fix galore
|
2024-03-08 00:44:51 +08:00 |
hiyouga
|
7230e1177d
|
add galore examples
|
2024-03-07 22:53:45 +08:00 |