Commit Graph

72 Commits

Author SHA1 Message Date
hiyouga cae4737907 lora modules: all by default 2024-06-06 03:53:28 +08:00
hiyouga dc4a00dd63 update train hparams 2024-06-06 01:49:20 +08:00
hiyouga 5a13b3baa6 tiny fix 2024-06-04 00:31:10 +08:00
hiyouga eed33862bc fix #4005 #4013 2024-06-03 19:12:29 +08:00
hiyouga 89ca832740 update readme 2024-05-29 18:39:11 +08:00
hiyouga 7c016b22aa support DDP in webui 2024-05-28 19:24:22 +08:00
hiyouga 30e1c8e745 update dpo examples 2024-05-27 19:56:04 +08:00
hiyouga cb63b32986 support SimPO #3900 2024-05-26 23:46:33 +08:00
hiyouga c450ee87a3 improve KTO impl., replace datasets 2024-05-18 03:44:56 +08:00
hiyouga e5bba7cf1b update badam example #3764 2024-05-17 02:21:10 +08:00
hiyouga ddec9e1b84 update examples 2024-05-17 01:02:00 +08:00
hiyouga 3df986c679 fix examples #3769 2024-05-16 19:12:09 +08:00
hiyouga 2a67ab3925 fix #3694 2024-05-16 00:35:28 +08:00
hiyouga 7e69e71a52 fix examples 2024-05-15 00:26:10 +08:00
hiyouga 5bdad46387 update examples 2024-05-15 00:05:17 +08:00
hiyouga af343034dd add npu examples 2024-05-14 23:32:53 +08:00
hiyouga dae83f4199 update examples 2024-05-13 20:39:36 +08:00
hiyouga b0888262e3 fix #3602 2024-05-07 17:50:27 +08:00
hiyouga 047313f48e update examples 2024-05-06 23:07:55 +08:00
hiyouga f02f87c6fb update example docs 2024-05-06 22:51:02 +08:00
hiyouga 34d33e2257 update docs 2024-05-06 21:47:00 +08:00
Oscar eeb415f6fa Fix badam example outdated argument 2024-05-05 23:35:19 +08:00
hiyouga 245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga 39e964a97a Update prepare.sh 2024-05-02 17:16:02 +08:00
hiyouga fc67b736ba fix llava qlora 2024-04-26 18:00:23 +08:00
hiyouga cd3a960f81 add llava to llamaboard 2024-04-26 06:41:35 +08:00
hiyouga e057c8de48 support mllm hf inference 2024-04-26 05:34:58 +08:00
BUAADreamer 68cdd9a020
Merge branch 'hiyouga:main' into main 2024-04-25 20:02:50 +08:00
BUAADreamer c6dd89918f merge data part to the text stream 2024-04-25 19:19:59 +08:00
hiyouga 3a7c1286ce add export_device in webui #3333 2024-04-25 19:02:32 +08:00
BUAADreamer cfb485eddf add llava and instructblip 2024-04-25 00:22:43 +08:00
BUAADreamer e1afbea68f add multimodal LLM BLIP-2 and InstructBLIP 2024-04-23 19:22:42 +08:00
BUAADreamer 4f3d558f67 add multimodal LLM BLIP-2 and InstructBLIP 2024-04-23 18:47:03 +08:00
BUAADreamer cde4dfe569
Merge branch 'hiyouga:main' into main 2024-04-23 18:46:12 +08:00
BUAADreamer 4dcb11eab7 add multimodal LLM BLIP-2 and InstructBLIP 2024-04-23 18:45:43 +08:00
hiyouga 2efd9b6ba0 update examples 2024-04-23 18:29:46 +08:00
hiyouga a1f1fac33b update readme and examples 2024-04-22 00:37:32 +08:00
hiyouga ddbd29d777 remove extras 2024-04-22 00:35:41 +08:00
hiyouga 5c62881c5a fix bug in galore optimizer 2024-04-21 18:53:22 +08:00
hiyouga f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
Marco 620add7b9f Added Mixture of Depths 2024-04-18 20:31:24 +02:00
hiyouga e3d8fc75eb support badam for all stages 2024-04-16 17:44:48 +08:00
hoshi-hiyouga 57dcd91e17
Update sft.sh 2024-04-16 17:25:40 +08:00
Jonery 7ecb61822b resolve gradient checkpointing issue. 2024-04-16 12:05:27 +08:00
Jonery 06c8908d3f Feature BAdam 2024-04-15 23:15:27 +08:00
hiyouga cce52351b5 update examples 2024-04-15 22:14:34 +08:00
khazic fe5d3bb8f0 Upgrade README.md 2024-04-13 20:50:49 +08:00
khazic 47111ce506 Added specimens for single-card full parameter prediction 2024-04-13 20:45:19 +08:00
hiyouga b87f8f1519 update examples 2024-04-04 14:48:21 +08:00
hiyouga fc7f1cc365 update examples 2024-04-02 21:09:25 +08:00