Commit Graph

375 Commits

Author SHA1 Message Date
hiyouga 2702d7e952 fix ppo in trl 0.8.6 2024-06-07 04:48:29 +08:00
hiyouga f9e818d79c fix #4120 2024-06-07 04:18:05 +08:00
hiyouga 8e95648850 add qwen2 models 2024-06-07 00:22:57 +08:00
hiyouga 53eb2de75e update readme 2024-06-06 16:59:18 +08:00
hiyouga 87a7822b98 update readme 2024-06-06 16:25:42 +08:00
hiyouga cae4737907 lora modules: all by default 2024-06-06 03:53:28 +08:00
hiyouga 946f601136 support image input in api #3971 #4061 2024-06-06 02:29:55 +08:00
hiyouga eef1e542a9 update readme 2024-06-05 16:32:32 +08:00
hiyouga f48f5e646e support glm-4 2024-06-05 15:16:38 +08:00
hiyouga c4f50865ad update readme 2024-05-30 16:40:17 +08:00
hiyouga 89ca832740 update readme 2024-05-29 18:39:11 +08:00
hoshi-hiyouga 880b4a9acf
Merge pull request #3930 from MengqingCao/npu
Add Ascend npu doc and dependency
2024-05-29 18:33:38 +08:00
MengqingCao e14f5b37e4 update cann kernels url 2024-05-29 09:53:31 +00:00
hiyouga 087b9faa39 update readme 2024-05-28 19:35:52 +08:00
hiyouga c8765349ba update readme 2024-05-28 16:41:34 +08:00
hiyouga 99ee0dadd9 update readme 2024-05-28 16:19:56 +08:00
hiyouga 5d45adf47d fix #3931 2024-05-28 13:44:22 +08:00
MengqingCao cd67d6eeb5 add Ascend npu doc and dependency 2024-05-28 01:33:54 +00:00
hiyouga 08bd0440b5 add llava 1k datasets 2024-05-27 19:57:33 +08:00
hiyouga efa4b196ca add phi-3 7b/14b, mistral v0.3 models 2024-05-27 18:20:16 +08:00
hiyouga 5581cb2e4e update readme 2024-05-27 18:14:02 +08:00
hiyouga cb63b32986 support SimPO #3900 2024-05-26 23:46:33 +08:00
donggang 2f68a71fc0 adapted to 910B image 2024-05-23 09:48:22 +00:00
hiyouga 2670f6fb3d update wechat 2024-05-21 18:22:32 +08:00
hiyouga 335501e228 fix #3847 2024-05-21 17:53:06 +08:00
hiyouga 2a67457e39 support paligemma 2024-05-21 00:01:22 +08:00
hiyouga 2bec28e328 update readme 2024-05-18 23:09:03 +08:00
hiyouga c450ee87a3 improve KTO impl., replace datasets 2024-05-18 03:44:56 +08:00
hiyouga d77bed4091 add falcon 11b 2024-05-17 00:08:33 +08:00
hiyouga 308edbc426 rename package 2024-05-16 18:39:08 +08:00
hiyouga b2fc7aeb03 set dev version 2024-05-16 02:17:31 +08:00
hiyouga a388cadfc0 add Yi-VL-34B model 2024-05-15 22:58:19 +08:00
hiyouga 73845fcc46 add yi-vl 6b model 2024-05-15 20:02:41 +08:00
hiyouga e1f4e53915 add NPU docker images 2024-05-15 19:20:11 +08:00
hiyouga b96d84835f update readme 2024-05-14 23:57:08 +08:00
hiyouga fc547ee591 update readme 2024-05-14 23:55:49 +08:00
hiyouga c27afa296b fix #3702 2024-05-13 18:24:35 +08:00
hiyouga d12b8f866a support Yi 1.5 2024-05-13 16:51:20 +08:00
hiyouga 58c522cd5c remove checksum and fix ui args 2024-05-12 01:10:30 +08:00
hiyouga 638043ced4 update readme 2024-05-12 00:33:49 +08:00
hoshi-hiyouga b8d5d9c8ef
Update README.md 2024-05-11 22:43:04 +08:00
BUAADreamer 508d474754
Merge branch 'hiyouga:main' into main 2024-05-10 20:34:41 +08:00
hiyouga 75aec4cf8e resolve python 3.8 package 2024-05-09 16:52:27 +08:00
BUAADreamer fdb3955448 add mllm processor save and Chinese-LLaVA-Med show 2024-05-09 13:53:39 +08:00
hiyouga 10ab83f4c4 add deepseek moe 236B 2024-05-08 16:37:54 +08:00
hiyouga b3a9ae4085 update readme 2024-05-07 22:17:04 +08:00
hiyouga 92e9195b3c update readme 2024-05-07 21:17:31 +08:00
hiyouga 5177f3ba90 update readme 2024-05-07 19:03:47 +08:00
Katehuuh 984f7fbbf7
Update README.md
Add Projects Nekochu/Luminia-13B-v3
2024-05-07 06:23:36 +02:00
hiyouga 8e09e20ece update readme 2024-05-07 06:19:29 +08:00
hiyouga f50c365871 update readme 2024-05-06 23:34:59 +08:00
hiyouga f02f87c6fb update example docs 2024-05-06 22:51:02 +08:00
hiyouga 34d33e2257 update docs 2024-05-06 21:47:00 +08:00
hiyouga 57a39783d1 update readme 2024-05-04 17:01:21 +08:00
hiyouga d4283bb6bf update readme 2024-05-04 00:43:53 +08:00
hiyouga 9d2ce57345 update readme and webui launch 2024-05-04 00:43:02 +08:00
hiyouga 1409654cef update readme 2024-05-04 00:31:02 +08:00
hiyouga 245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga 32347901d4 fix setup 2024-04-28 03:49:13 +08:00
hiyouga 5ee04d418c update readme 2024-04-26 23:39:19 +08:00
hiyouga 031775ade8 update readme 2024-04-26 20:09:14 +08:00
hiyouga 375b25131b support Qwen1.5 110B 2024-04-26 19:59:22 +08:00
hiyouga e83e2fa897 update readme 2024-04-26 05:49:26 +08:00
hiyouga 27ba1b63ce update readme 2024-04-26 05:44:30 +08:00
hiyouga 44a43ee152 add olmo 1.7 2024-04-24 05:50:50 +08:00
hiyouga 07737a3d2d reenable sdpa and fast tok by default 2024-04-24 02:18:44 +08:00
hiyouga 1a13f05555 support phi-3 2024-04-24 00:28:53 +08:00
hiyouga db7f3b9784 update readme 2024-04-22 17:09:17 +08:00
hiyouga 836ca05586 update readme 2024-04-22 00:51:35 +08:00
hiyouga 34d66a3a85 update readme 2024-04-22 00:42:25 +08:00
hiyouga a1f1fac33b update readme and examples 2024-04-22 00:37:32 +08:00
hiyouga a83e7587a0 update readme 2024-04-22 00:21:01 +08:00
hiyouga f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
Marco 620add7b9f Added Mixture of Depths 2024-04-18 20:31:24 +02:00
hoshi-hiyouga 2aaaede247 support llama3 2024-04-19 01:13:50 +08:00
hiyouga 942362d008 fix #3324 2024-04-18 15:34:45 +08:00
hiyouga 3b43a3b7c5 tiny fix 2024-04-18 00:22:17 +08:00
hiyouga e2f1c6fc6a update readme 2024-04-17 23:40:49 +08:00
hiyouga cab0598fd0 add mixtral 8x22B models 2024-04-17 23:35:59 +08:00
hiyouga 5d62a51c12 update readme and gradio version 2024-04-16 18:09:16 +08:00
hiyouga e3d8fc75eb support badam for all stages 2024-04-16 17:44:48 +08:00
hiyouga cf52911fed update readme 2024-04-16 02:36:54 +08:00
hiyouga 6084eb7cf1 update readme 2024-04-16 02:35:36 +08:00
hiyouga 6543f3d449 add codegemma 2024-04-16 00:11:15 +08:00
hiyouga e0dbac2845 support cohere commandR #3184 2024-04-15 23:26:42 +08:00
hiyouga 9d4c949461 release v0.6.2 2024-04-11 20:08:51 +08:00
hiyouga a88fe8c1af update readme 2024-04-07 00:48:24 +08:00
hiyouga 7f6e412604 fix requires for windows 2024-04-03 21:56:43 +08:00
hiyouga 49a2dfaf90 update vllm example 2024-04-02 22:45:20 +08:00
hiyouga 66b0fe4e96 update readme 2024-04-02 22:17:48 +08:00
hiyouga 7765f337c7 add zh readme 2024-04-02 20:58:45 +08:00
hiyouga 11a6c1bad6 update readme 2024-04-02 20:37:37 +08:00
hiyouga 949e5fe638 update readme 2024-04-02 20:22:11 +08:00
hiyouga 92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga 54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga 099db6acc0 update readme 2024-03-31 18:46:34 +08:00
hiyouga 17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga c1fe6ce782 update readme 2024-03-28 22:02:32 +08:00
hiyouga 1e43319f9c add project 2024-03-28 20:24:27 +08:00