hiyouga
|
4c45a3a884
|
release v0.1.2
|
2023-07-20 22:33:59 +08:00 |
hiyouga
|
8333795fcb
|
fix api
|
2023-07-20 22:14:54 +08:00 |
hoshi-hiyouga
|
6022adb72e
|
Merge pull request #213 from Ehco1996/patch-1
feat: support pass args before init web app
|
2023-07-20 22:12:07 +08:00 |
hiyouga
|
4d1641c1bf
|
update UI, fix #212
|
2023-07-20 22:09:06 +08:00 |
Ehco
|
b2874a72e4
|
feat: support pass args before init web app
as title
|
2023-07-20 21:49:26 +08:00 |
hiyouga
|
67a2773074
|
simplify code
|
2023-07-20 15:08:57 +08:00 |
hiyouga
|
d1d8e8bae1
|
tiny fix
|
2023-07-19 22:53:46 +08:00 |
hiyouga
|
d111e658a2
|
fix #199
|
2023-07-19 22:51:29 +08:00 |
hiyouga
|
7159bc54ed
|
add datasets
|
2023-07-19 20:59:15 +08:00 |
hiyouga
|
925a790bc9
|
fix #196
|
2023-07-19 17:35:38 +08:00 |
hiyouga
|
8f7819fcaa
|
fix #194
|
2023-07-19 17:07:33 +08:00 |
hiyouga
|
7a3ade8c69
|
support LLaMA-2
|
2023-07-19 16:42:14 +08:00 |
hiyouga
|
38eb1aaf55
|
add LLaMA2 template
|
2023-07-19 00:44:49 +08:00 |
hiyouga
|
29af67b015
|
fix API
|
2023-07-19 00:01:14 +08:00 |
hiyouga
|
fe2887ca13
|
support dev set in web ui
|
2023-07-18 20:40:49 +08:00 |
hiyouga
|
b447fa85aa
|
add web demo
|
2023-07-18 17:21:16 +08:00 |
hiyouga
|
bdf91846da
|
update baichuan template
|
2023-07-18 16:43:51 +08:00 |
hiyouga
|
d1ae428c6e
|
fix template
|
2023-07-18 16:37:23 +08:00 |
hiyouga
|
cadeac0f44
|
fix #176
|
2023-07-18 16:36:24 +08:00 |
hiyouga
|
6f9360c0bd
|
fix webUI, fix #171 #177
|
2023-07-18 15:51:48 +08:00 |
hiyouga
|
12d8a8633f
|
update webUI, fix #179
|
2023-07-18 15:35:17 +08:00 |
hiyouga
|
b9fe83fb75
|
tiny fix
|
2023-07-18 00:52:31 +08:00 |
hiyouga
|
262252d67b
|
a monkey patch for lora_target
|
2023-07-18 00:31:40 +08:00 |
hiyouga
|
f8193e8009
|
release v0.1.0
|
2023-07-18 00:18:25 +08:00 |
hiyouga
|
85c2210452
|
fix #175
|
2023-07-17 18:07:17 +08:00 |
hiyouga
|
1e1358431d
|
fix saving custom code
|
2023-07-16 18:04:41 +08:00 |
hiyouga
|
552d773dad
|
fix callback
|
2023-07-15 22:01:43 +08:00 |
hiyouga
|
8528a84e74
|
update stream_chat
|
2023-07-15 19:51:02 +08:00 |
hiyouga
|
657cf0f55a
|
create chat model
|
2023-07-15 19:26:20 +08:00 |
hiyouga
|
d640c5545f
|
Update callbacks.py
|
2023-07-15 17:39:16 +08:00 |
hiyouga
|
22d9a9c2af
|
fix callback
|
2023-07-15 17:18:16 +08:00 |
hiyouga
|
f751376613
|
modity code structure
|
2023-07-15 16:54:28 +08:00 |
hiyouga
|
c30db9f1f0
|
fix eval and pred loss
|
2023-07-14 13:11:57 +08:00 |
hiyouga
|
a04115ec27
|
fix pretrain
|
2023-07-13 23:41:54 +08:00 |
hiyouga
|
08439d29b2
|
fix Baichuan-13B
|
2023-07-13 23:08:45 +08:00 |
Jinghuan Shang
|
dc1e8b7181
|
Fix typo in common.py
lastest -> latest
|
2023-07-11 18:03:53 -04:00 |
hiyouga
|
b2f7cb4465
|
fix sft encode
|
2023-07-11 19:50:33 +08:00 |
hiyouga
|
1af031c02b
|
add baichuan template
|
2023-07-11 18:57:50 +08:00 |
hiyouga
|
84a06318d4
|
update api to match langchain
|
2023-07-07 20:35:39 +08:00 |
hiyouga
|
a2f507c562
|
support InternLM
|
2023-07-07 11:02:28 +08:00 |
hiyouga
|
caa00d3ac2
|
fix rouge score
|
2023-07-06 14:28:34 +08:00 |
hiyouga
|
4abd2485e1
|
fix streaming response in API
|
2023-07-05 22:42:31 +08:00 |
hiyouga
|
e6603977f6
|
fix freeze tuning
|
2023-07-05 21:18:28 +08:00 |
hiyouga
|
a2ba69183b
|
fix bug in PPO stage
|
2023-07-05 19:14:10 +08:00 |
hiyouga
|
8e3540c62d
|
fix compute dtype
|
2023-07-05 15:13:00 +08:00 |
hiyouga
|
c136f362c1
|
support falcon model #72
|
2023-07-05 15:00:06 +08:00 |
hiyouga
|
cac87fd553
|
fix bleu score
|
2023-07-05 00:11:21 +08:00 |
hiyouga
|
395ed1cf1b
|
set use_cache before saving model
|
2023-07-04 23:18:20 +08:00 |
hiyouga
|
65e9ce2cdd
|
fix seq2seq predictions
|
2023-07-04 22:56:51 +08:00 |
hiyouga
|
92fa515e97
|
fix typo
|
2023-06-30 10:09:59 +08:00 |
hiyouga
|
4d0fddba21
|
update loading logic
|
2023-06-28 12:07:16 +08:00 |
hiyouga
|
0a46313cca
|
fix loading best model
|
2023-06-28 01:55:12 +08:00 |
hiyouga
|
7826a8ca77
|
fix RM accuracy
|
2023-06-28 01:40:13 +08:00 |
hiyouga
|
450910c1db
|
tiny fix
|
2023-06-27 23:54:24 +08:00 |
hiyouga
|
18f87c1b25
|
fix initializing data arguments
|
2023-06-27 22:50:23 +08:00 |
hiyouga
|
2e01abfda5
|
support save full model, replace BOS token
|
2023-06-27 21:40:11 +08:00 |
hiyouga
|
1c732e2537
|
fix decoding in seq2seq
|
2023-06-27 19:33:08 +08:00 |
hiyouga
|
1175948029
|
fix generation in seq2seq.py
|
2023-06-26 18:07:06 +08:00 |
hiyouga
|
cec9760eb8
|
support prefixes, loading multiple local files
|
2023-06-26 15:32:40 +08:00 |
hiyouga
|
f030b09924
|
update api
|
2023-06-26 13:39:57 +08:00 |
hiyouga
|
0697643358
|
update readme
|
2023-06-23 00:17:05 +08:00 |
hiyouga
|
614d3a996c
|
update API
|
2023-06-22 20:46:24 +08:00 |
hiyouga
|
76ecb8c222
|
match api with OpenAI format
|
2023-06-22 20:27:00 +08:00 |
Bun
|
6e4db0903f
|
Compatible with OpenAI API.
|
2023-06-21 14:45:04 +08:00 |
hiyouga
|
f621f7631a
|
add default template
|
2023-06-16 21:12:17 +08:00 |
hiyouga
|
a6c4b141cd
|
fix freeze layers
|
2023-06-16 17:38:21 +08:00 |
hiyouga
|
fc4d8155b3
|
add source prefix
|
2023-06-16 16:32:17 +08:00 |
hiyouga
|
0574b590ef
|
support loading lora from hub
|
2023-06-16 00:02:17 +08:00 |
hiyouga
|
0cee6ad67f
|
support baichuan model
|
2023-06-15 16:02:01 +08:00 |
hiyouga
|
c527399424
|
fix bug in template vanilla
|
2023-06-15 14:36:55 +08:00 |
hiyouga
|
d668f8b501
|
add BOS token in pre-training
|
2023-06-15 01:46:17 +08:00 |
hiyouga
|
b6faf0207d
|
support multiturn training like FastChat
|
2023-06-14 22:27:39 +08:00 |
hiyouga
|
875e8e2349
|
fix loading valuehead
|
2023-06-13 11:13:06 +08:00 |
hiyouga
|
531a3764d9
|
fix generating args
|
2023-06-13 01:33:56 +08:00 |
hiyouga
|
cec6524d6b
|
support RM metrics, add generating Args
|
2023-06-12 15:48:48 +08:00 |
BUAADreamer
|
a2af9df5a9
|
add code for reading from multi files in one directory
|
2023-06-10 16:27:30 +08:00 |
BUAADreamer
|
3dd5f9a874
|
add code for reading from multi files in one directory
|
2023-06-10 15:53:47 +08:00 |
hiyouga
|
2ba5d69c7f
|
tiny fix
|
2023-06-07 16:42:31 +08:00 |
hiyouga
|
16c2860d56
|
tiny fix
|
2023-06-07 16:02:07 +08:00 |
hiyouga
|
edafb97733
|
tiny fix
|
2023-06-07 12:58:14 +08:00 |
hiyouga
|
3875b19a34
|
add templates
|
2023-06-07 12:40:44 +08:00 |
hiyouga
|
17acf3a3eb
|
add belle template
|
2023-06-07 12:30:11 +08:00 |
hiyouga
|
ce43386080
|
tiny fix
|
2023-06-07 12:08:39 +08:00 |
hiyouga
|
909af8f496
|
add prompt template class
|
2023-06-07 11:55:25 +08:00 |
hiyouga
|
5d021d4ad5
|
fix inference, add prompt template
|
2023-06-07 10:52:35 +08:00 |
hiyouga
|
13d1f0709c
|
recover logging
|
2023-06-06 21:36:37 +08:00 |
hiyouga
|
4eb17bcf6c
|
support distributed quantized training
|
2023-06-06 17:39:41 +08:00 |
hiyouga
|
3d8d5ee5d5
|
add API demo from #1
|
2023-06-05 21:32:18 +08:00 |
hoshi-hiyouga
|
06e1b120e1
|
Merge pull request #11 from hiyouga/api
Api
|
2023-06-05 20:58:02 +08:00 |
hiyouga
|
a38d57ddd7
|
fix bug in web demo
|
2023-06-05 17:58:29 +08:00 |
hiyouga
|
56eb99106a
|
increase max length in cli demo
|
2023-06-05 16:49:14 +08:00 |
hiyouga
|
fe1d930816
|
implement stream generating
|
2023-06-05 16:43:44 +08:00 |
hiyouga
|
44298c1235
|
tiny fix
|
2023-06-05 15:25:22 +08:00 |
hiyouga
|
38b83533a4
|
tiny fix
|
2023-06-04 16:35:50 +08:00 |
hiyouga
|
eac9921e5c
|
tiny fix
|
2023-06-04 12:55:40 +08:00 |
hiyouga
|
3b9eee8cd2
|
support QLoRA
|
2023-06-04 00:08:56 +08:00 |
hiyouga
|
1bd13d7ca1
|
fix int8 inference
|
2023-06-03 23:22:05 +08:00 |
hiyouga
|
926291940d
|
reduce repetition penalty
|
2023-06-03 21:57:39 +08:00 |
hiyouga
|
0f69a0c19e
|
fix int8 inference
|
2023-06-03 21:17:47 +08:00 |
hiyouga
|
de09ee1315
|
add ziya prompt template
|
2023-06-03 19:05:51 +08:00 |
hiyouga
|
771f454ff1
|
use low_cpu_mem_usage to speed up loading
|
2023-06-03 18:19:01 +08:00 |
hiyouga
|
dca27b4412
|
add logits processor
|
2023-06-03 16:34:54 +08:00 |
hiyouga
|
ed6161fa6a
|
remove unused code
|
2023-06-03 00:10:54 +08:00 |
hiyouga
|
b8a034807e
|
tiny fix
|
2023-06-02 19:02:25 +08:00 |
hiyouga
|
e3aaef7d4a
|
fix layer norm name in PPO
|
2023-06-02 17:30:01 +08:00 |
hiyouga
|
bd565af370
|
fix #1
|
2023-06-02 14:25:00 +08:00 |
hiyouga
|
50d9a20f81
|
alter rewards data type
|
2023-06-02 14:19:51 +08:00 |
hiyouga
|
e6126244c1
|
fix possibly OOM error
|
2023-06-01 23:54:44 +08:00 |
hiyouga
|
fd709eacff
|
fix bug at inference
|
2023-05-31 18:11:53 +08:00 |
hiyouga
|
740a5daf56
|
support BLOOM models
|
2023-05-31 16:54:06 +08:00 |
hoshi-hiyouga
|
c36620ece4
|
Merge pull request #1 from mMrBun/main
Support conversation via API.
|
2023-05-30 16:34:00 +08:00 |
hiyouga
|
a72492e649
|
remove dummy code
|
2023-05-30 16:28:00 +08:00 |
mMrBun
|
748b804bac
|
Support conversation via API.
|
2023-05-30 15:00:28 +08:00 |
mMrBun
|
e821682430
|
Support conversation via API.
|
2023-05-30 14:46:22 +08:00 |
hiyouga
|
8ff96509fa
|
add pre-training script
|
2023-05-29 21:37:22 +08:00 |
hiyouga
|
c0e5df92d6
|
fix checkpoint loading
|
2023-05-29 17:43:16 +08:00 |
hiyouga
|
ce71cc8b6d
|
tiny fix
|
2023-05-29 09:42:29 +08:00 |
hiyouga
|
166c837b95
|
tiny fix
|
2023-05-28 21:48:33 +08:00 |
hiyouga
|
0c9fda01e3
|
use fp16 model, add logcallback
|
2023-05-28 21:30:28 +08:00 |
hiyouga
|
769c6ab56b
|
Initial commit
|
2023-05-28 18:09:04 +08:00 |