Commit Graph

379 Commits

Author SHA1 Message Date
hiyouga 92e9195b3c update readme 2024-05-07 21:17:31 +08:00
hiyouga 5177f3ba90 update readme 2024-05-07 19:03:47 +08:00
Katehuuh 984f7fbbf7
Update README.md
Add Projects Nekochu/Luminia-13B-v3
2024-05-07 06:23:36 +02:00
hiyouga 8e09e20ece update readme 2024-05-07 06:19:29 +08:00
hiyouga f50c365871 update readme 2024-05-06 23:34:59 +08:00
hiyouga f02f87c6fb update example docs 2024-05-06 22:51:02 +08:00
hiyouga 34d33e2257 update docs 2024-05-06 21:47:00 +08:00
hiyouga 57a39783d1 update readme 2024-05-04 17:01:21 +08:00
hiyouga d4283bb6bf update readme 2024-05-04 00:43:53 +08:00
hiyouga 9d2ce57345 update readme and webui launch 2024-05-04 00:43:02 +08:00
hiyouga 1409654cef update readme 2024-05-04 00:31:02 +08:00
hiyouga 245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga 32347901d4 fix setup 2024-04-28 03:49:13 +08:00
hiyouga 5ee04d418c update readme 2024-04-26 23:39:19 +08:00
hiyouga 031775ade8 update readme 2024-04-26 20:09:14 +08:00
hiyouga 375b25131b support Qwen1.5 110B 2024-04-26 19:59:22 +08:00
hiyouga e83e2fa897 update readme 2024-04-26 05:49:26 +08:00
hiyouga 27ba1b63ce update readme 2024-04-26 05:44:30 +08:00
hiyouga 44a43ee152 add olmo 1.7 2024-04-24 05:50:50 +08:00
hiyouga 07737a3d2d reenable sdpa and fast tok by default 2024-04-24 02:18:44 +08:00
hiyouga 1a13f05555 support phi-3 2024-04-24 00:28:53 +08:00
hiyouga db7f3b9784 update readme 2024-04-22 17:09:17 +08:00
hiyouga 836ca05586 update readme 2024-04-22 00:51:35 +08:00
hiyouga 34d66a3a85 update readme 2024-04-22 00:42:25 +08:00
hiyouga a1f1fac33b update readme and examples 2024-04-22 00:37:32 +08:00
hiyouga a83e7587a0 update readme 2024-04-22 00:21:01 +08:00
hiyouga f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
Marco 620add7b9f Added Mixture of Depths 2024-04-18 20:31:24 +02:00
hoshi-hiyouga 2aaaede247 support llama3 2024-04-19 01:13:50 +08:00
hiyouga 942362d008 fix #3324 2024-04-18 15:34:45 +08:00
hiyouga 3b43a3b7c5 tiny fix 2024-04-18 00:22:17 +08:00
hiyouga e2f1c6fc6a update readme 2024-04-17 23:40:49 +08:00
hiyouga cab0598fd0 add mixtral 8x22B models 2024-04-17 23:35:59 +08:00
hiyouga 5d62a51c12 update readme and gradio version 2024-04-16 18:09:16 +08:00
hiyouga e3d8fc75eb support badam for all stages 2024-04-16 17:44:48 +08:00
hiyouga cf52911fed update readme 2024-04-16 02:36:54 +08:00
hiyouga 6084eb7cf1 update readme 2024-04-16 02:35:36 +08:00
hiyouga 6543f3d449 add codegemma 2024-04-16 00:11:15 +08:00
hiyouga e0dbac2845 support cohere commandR #3184 2024-04-15 23:26:42 +08:00
hiyouga 9d4c949461 release v0.6.2 2024-04-11 20:08:51 +08:00
hiyouga a88fe8c1af update readme 2024-04-07 00:48:24 +08:00
hiyouga 7f6e412604 fix requires for windows 2024-04-03 21:56:43 +08:00
hiyouga 49a2dfaf90 update vllm example 2024-04-02 22:45:20 +08:00
hiyouga 66b0fe4e96 update readme 2024-04-02 22:17:48 +08:00
hiyouga 7765f337c7 add zh readme 2024-04-02 20:58:45 +08:00
hiyouga 11a6c1bad6 update readme 2024-04-02 20:37:37 +08:00
hiyouga 949e5fe638 update readme 2024-04-02 20:22:11 +08:00
hiyouga 92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga 54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga 099db6acc0 update readme 2024-03-31 18:46:34 +08:00
hiyouga 17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga c1fe6ce782 update readme 2024-03-28 22:02:32 +08:00
hiyouga 1e43319f9c add project 2024-03-28 20:24:27 +08:00
hiyouga 6c94305e47 update readme 2024-03-28 18:35:11 +08:00
hiyouga 8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hiyouga 7b3d8188f5 update readme 2024-03-25 23:06:13 +08:00
hoshi-hiyouga f633ac6646
Merge pull request #2967 from Tsumugii24/main
Update README_zh.md
2024-03-25 23:02:22 +08:00
Tsumugii24 1704599503 Update README.md 2024-03-25 22:54:38 +08:00
hiyouga 6f2b563f12 release v0.6.0 2024-03-25 22:38:56 +08:00
hiyouga a1c8c98c5f fix #2941 2024-03-24 00:28:44 +08:00
0xez 675ba41562
Update README.md, fix the release date of the paper 2024-03-21 22:14:48 +08:00
hiyouga 5eaa50fa01 add citation 2024-03-21 17:04:10 +08:00
hiyouga 0581bfdbc7 paper release 2024-03-21 13:49:17 +08:00
hiyouga bfe7a91289 update readme 2024-03-21 00:48:42 +08:00
hiyouga 8408225162 support fsdp + qlora 2024-03-21 00:36:06 +08:00
hiyouga 9bec3c98a2 fix #2777 #2895 2024-03-20 17:59:45 +08:00
khazic 0531dac30d Updated README with new information 2024-03-20 14:21:16 +08:00
刘一博 df9b4fb90a Updated README with new information 2024-03-20 14:11:28 +08:00
hiyouga 72367307df improve lora+ impl. 2024-03-13 23:32:51 +08:00
hiyouga b3247d6a16 support olmo 2024-03-12 18:30:38 +08:00
hoshi-hiyouga c901aa63ff
Merge pull request #2743 from S3Studio/DockerizeSupport
Add dockerize support
2024-03-12 00:05:49 +08:00
hiyouga 8664262cde support layerwise galore 2024-03-10 00:24:11 +08:00
hiyouga 818726e9bc add GaLore results 2024-03-09 04:11:55 +08:00
hiyouga 393c2de27c update hardware requirements 2024-03-09 03:58:18 +08:00
hiyouga 10be2f0ecc fix aqlm version 2024-03-09 00:09:09 +08:00
S3Studio 3d911ae713 Add dockerize support
Already tested with the model of Qwen:1.8B and the dataset of alpaca_data_zh. Some python libraries are added to the Dockerfile as a result of the exception messages displayed throughout test procedure.
2024-03-08 10:47:28 +08:00
hiyouga 4a2cc60b94 update readme 2024-03-08 03:06:21 +08:00
hiyouga 33a4c24a8a fix galore 2024-03-08 00:44:51 +08:00
hiyouga 57452a4aa1 add Yi-9B model 2024-03-07 23:11:57 +08:00
hiyouga 7230e1177d add galore examples 2024-03-07 22:53:45 +08:00
hiyouga 28f7862188 support galore 2024-03-07 22:41:36 +08:00
hiyouga 725f7cd70f update readme 2024-03-07 20:34:49 +08:00
hiyouga 77211d9843 tiny fix 2024-03-07 20:29:34 +08:00
hiyouga d07ad5cc1c support vllm 2024-03-07 20:26:31 +08:00
hiyouga 0048a2021e tiny fix 2024-03-06 17:25:08 +08:00
hiyouga 9658c63cd9 fix add tokens 2024-03-06 15:04:02 +08:00
hiyouga 3016e65657 fix version checking 2024-03-06 14:51:51 +08:00
hiyouga df9e6bb063 update readme 2024-03-05 03:20:23 +08:00
hiyouga 24a79bd50f update readme 2024-03-04 19:29:26 +08:00
hiyouga 7c227e07dd update readme 2024-03-03 01:41:07 +08:00
hiyouga 894d183214 update readme, add starcoder2, cosmopedia 2024-03-03 01:01:46 +08:00
hoshi-hiyouga 4bf7eb72e0
Update README.md 2024-03-03 00:48:47 +08:00
hoshi-hiyouga 585c884ea9
Update README.md 2024-03-03 00:48:06 +08:00
hiyouga 318315c76d add colab demo 2024-03-02 19:58:21 +08:00
hiyouga bb16502c33 add twitter 2024-02-29 17:45:30 +08:00
hiyouga fa5ab21ebc release v0.5.3 2024-02-29 00:34:19 +08:00
hiyouga 804c1e7083 add examples 2024-02-28 23:19:25 +08:00
hiyouga 38d8b2cef8 update chatglm3 template 2024-02-28 21:11:23 +08:00
hiyouga a2dccce06a update readme 2024-02-28 20:50:01 +08:00
hiyouga cfefacaa37 support DoRA, AWQ, AQLM #2512 2024-02-28 19:53:28 +08:00
hiyouga 3ba1054593 update readme 2024-02-26 17:25:47 +08:00
hiyouga 261f631a1c update readme 2024-02-25 16:26:08 +08:00
hiyouga aca948da8f add papers 2024-02-25 15:34:47 +08:00
hiyouga ad76482cf9 add papers 2024-02-25 15:18:58 +08:00
hiyouga c99e19641a support gemma 2024-02-21 23:27:36 +08:00
hiyouga daa3185350 tiny fix 2024-02-21 18:30:29 +08:00
hoshi-hiyouga 869fd208a8
Update README.md 2024-02-20 16:07:55 +08:00
codemayq d47e40633a 1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library
2024-02-20 11:28:25 +08:00
codemayq 95f53a46bd 1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library
2024-02-20 11:26:22 +08:00
hiyouga 7924ffc55d support llama pro #2338 , add rslora 2024-02-15 02:27:36 +08:00
hiyouga 7d2dc83c5e improve aligner 2024-02-10 16:39:19 +08:00
hiyouga 54ea9684ed improve fix tokenizer 2024-02-09 14:53:14 +08:00
hoshi-hiyouga d0daaa01f9
Merge pull request #2423 from mayflower/main
Support for german sft and dpo
2024-02-07 15:58:20 +08:00
hiyouga ccabb5b04a support qwen1.5 2024-02-06 00:10:51 +08:00
Johann-Peter Hartmann d9a8301ed4 Add support for german datasets 2024-01-30 10:18:01 +01:00
hiyouga a0d59aa4ec release v0.5.0 (real) 2024-01-21 01:54:49 +08:00
hiyouga 5608a0da8e update readme 2024-01-18 14:30:48 +08:00
hiyouga 5a207bb723 tiny fix 2024-01-15 23:34:23 +08:00
Junu Moon(Fran) 7a320de097
fix: typo on README.md 2024-01-15 19:50:35 +09:00
hiyouga ca3933dc52 support deepseek moe 2024-01-14 00:14:49 +08:00
hiyouga d1a73fe26c fix phi modules 2024-01-13 23:12:47 +08:00
JessyTsu1 8c5e4a8896
Update README.md 2024-01-11 23:18:29 +08:00
JessyTsu1 d72aff5ae6
Update README.md 2024-01-11 23:17:00 +08:00
hiyouga 4571068e1e fix #1789 2024-01-09 18:31:27 +08:00
hiyouga c7ea17d616 add yuan model 2023-12-29 13:50:24 +08:00
hiyouga 65c5b0477c fix args 2023-12-28 18:47:19 +08:00
hiyouga 5b93d545e2 tiny update 2023-12-25 18:29:34 +08:00
hiyouga e44b82ee24 update patcher 2023-12-23 15:24:27 +08:00
hiyouga 0ad86a4f62 update readme 2023-12-23 02:17:41 +08:00
hiyouga 7aad0b889d support unsloth 2023-12-23 00:14:33 +08:00
hiyouga edb7d177c2 update readme 2023-12-18 22:29:45 +08:00
hiyouga 2b4e5f0d32 update readme 2023-12-18 15:46:45 +08:00
hiyouga 71389be37c support autogptq in llama board #246 2023-12-16 16:31:30 +08:00
hiyouga 3524aa1e58 support quantization in export model 2023-12-15 23:44:50 +08:00
hiyouga 87ef3f47b5 update dc link 2023-12-15 22:11:31 +08:00
hiyouga 0716f5e470 refactor adapter hparam 2023-12-15 20:53:11 +08:00
hiyouga 3a8a50d4d4 remove loftq 2023-12-13 01:53:46 +08:00
hiyouga 28cc07868c update readme 2023-12-12 23:30:29 +08:00
hiyouga 6219dfbd93 support loftq 2023-12-12 22:47:06 +08:00
hiyouga 0a9c6e0146 support system column #1765 2023-12-12 19:45:59 +08:00
hiyouga 8cace77808 update readme 2023-12-12 11:44:30 +08:00
hiyouga 96380f5e18 support mixtral 2023-12-12 11:39:04 +08:00
hiyouga 997b65f291 update readme 2023-12-04 11:22:01 +08:00
hiyouga 8ede3128df update readme 2023-12-04 11:02:29 +08:00
hiyouga 5b78e269b6 add logo 2023-12-02 01:31:24 +08:00
hiyouga 0cb260f453 update readme 2023-12-01 22:58:29 +08:00
hiyouga bd42c229b0 patch modelscope 2023-12-01 22:53:15 +08:00
hoshi-hiyouga 00f5c9ee16
Merge branch 'main' into feat/support_ms 2023-12-01 20:23:46 +08:00
yuze.zyz 5aa6751e52 add readme 2023-12-01 16:11:30 +08:00