LLaMA-Factory-310P3

Commit Graph

Select branches

Hide Pull Requests

hiyouga-patch-1

main

#1

#1059

#11

#1186

#119

#1252

#1326

#1348

#1353

#1375

#1436

#145

#1454

#1486

#1525

#1544

#1553

#156

#158

#1624

#1624

#1689

#1690

#1695

#1699

#1700

#171

#179

#1796

#1800

#1802

#1861

#1864

#1868

#1918

#1932

#1946

#1947

#1953

#1954

#200

#2007

#2019

#2100

#2117

#213

#2163

#2194

#22

#2201

#221

#2226

#2262

#2264

#2266

#2283

#2285

#2319

#2350

#2411

#2423

#2426

#2435

#2445

#2462

#2469

#2474

#2514

#2519

#2525

#2531

#2568

#2570

#2572

#2575

#258

#26

#2608

#2683

#2689

#2730

#2739

#2743

#2746

#2764

#2766

#2830

#2845

#2849

#2872

#2903

#2905

#2919

#2944

#2945

#2963

#2967

#2993

#3004

#3046

#3053

#3057

#306

#3066

#307

#3083

#3103

#3103

#3158

#3159

#3160

#3161

#3201

#3226

#3254

#3256

#3261

#3263

#3267

#3275

#3276

#3287

#3288

#3291

#33

#3332

#3338

#3357

#3371

#3383

#3394

#3412

#3423

#3435

#3449

#3450

#3454

#3471

#3484

#3487

#3490

#3498

#3511

#3513

#3527

#3532

#356

#3578

#3584

#3588

#3596

#3601

#3604

#3651

#3654

#3655

#3661

#3683

#3692

#3702

#3741

#3746

#3748

#3755

#3756

#3785

#3792

#3794

#3799

#3804

#3812

#382

#3829

#3835

#387

#3876

#3921

#3923

#3925

#3930

#3941

#395

#3958

#3976

#3976

#3987

#4003

#4006

#4007

#4009

#4011

#4015

#4029

#4043

#4045

#4053

#4066

#4080

#4082

#4083

#4098

#4099

#4119

#4136

#4136

#4166

#4167

#4173

#4191

#4204

#4224

#4227

#4234

#4237

#4245

#4246

#4307

#4309

#4314

#4321

#4329

#4334

#434

#4342

#4347

#4348

#4352

#4355

#4377

#4377

#4382

#4409

#4417

#4445

#4446

#4461

#451

#4544

#4561

#4580

#4589

#4590

#4636

#4651

#4662

#4663

#4673

#4680

#4680

#4686

#4686

#4687

#4691

#4692

#4700

#4706

#4724

#4733

#4746

#4781

#479

#4793

#4804

#4821

#4822

#4877

#4877

#4878

#4892

#4939

#4950

#4957

#4957

#4961

#4970

#4995

#4996

#5010

#5019

#5019

#5032

#5037

#5068

#5068

#5072

#5095

#51

#5109

#511

#5111

#5112

#5115

#5118

#5118

#5156

#516

#5163

#5170

#5185

#5188

#5193

#5208

#5226

#5230

#5233

#5233

#5237

#5242

#5278

#596

#598

#599

#619

#629

#644

#651

#657

#678

#68

#741

#766

#786

#83

#84

#844

#86

#900

#975

v0.0.9

v0.1.0

v0.1.1

v0.1.2

v0.1.3

v0.1.4

v0.1.5

v0.1.6

v0.1.7

v0.1.8

v0.2.0

v0.2.1

v0.2.2

v0.3.0

v0.3.2

v0.3.3

v0.4.0

v0.5.0

v0.5.2

v0.5.3

v0.6.0

v0.6.1

v0.6.2

v0.6.3

v0.7.0

v0.7.1

v0.8.0

v0.8.1

v0.8.2

v0.8.3

4abd2485e1 fix streaming response in API hiyouga 2023-07-05 22:42:31 +0800
e6603977f6 fix freeze tuning hiyouga 2023-07-05 21:18:28 +0800
a2ba69183b fix bug in PPO stage hiyouga 2023-07-05 19:14:10 +0800
8e3540c62d fix compute dtype hiyouga 2023-07-05 15:13:00 +0800
c136f362c1 support falcon model #72 hiyouga 2023-07-05 15:00:06 +0800
966b5c70fc Update wechat.jpg hiyouga 2023-07-05 00:22:22 +0800
cac87fd553 fix bleu score hiyouga 2023-07-05 00:11:21 +0800
395ed1cf1b set use_cache before saving model hiyouga 2023-07-04 23:18:20 +0800
65e9ce2cdd fix seq2seq predictions hiyouga 2023-07-04 22:56:51 +0800
cb26f78923

Merge pull request #119 from codemayq/main hoshi-hiyouga 2023-07-03 19:51:46 +0800
d3b30ecde3 add the pre-built version of bitsandbytes library for windows user codemayq 2023-07-03 13:58:10 +0800
0db9d29111 Update auto_gptq.py hiyouga 2023-07-02 20:56:11 +0800
cf6d57fd3e add autogptq hiyouga 2023-07-02 20:36:37 +0800
b8e1f09a2e Update wechat.jpg hiyouga 2023-06-30 15:45:20 +0800
92fa515e97 fix typo hiyouga 2023-06-30 10:09:59 +0800
021b035c1e Update README.md hiyouga 2023-06-29 19:36:22 +0800
f14bd729a8 rename evaluate.py hiyouga 2023-06-29 15:40:39 +0800
23a7266272 Update evaluate.py hiyouga 2023-06-29 15:40:03 +0800
70592035b8 Update README.md hiyouga 2023-06-29 15:37:19 +0800
3154fec979 add open assistant dataset hiyouga 2023-06-28 23:09:33 +0800
4d0fddba21 update loading logic hiyouga 2023-06-28 12:07:16 +0800
0a46313cca fix loading best model hiyouga 2023-06-28 01:55:12 +0800
7826a8ca77 fix RM accuracy hiyouga 2023-06-28 01:40:13 +0800
9cb1af71f3 add star history hiyouga 2023-06-27 23:56:29 +0800
450910c1db tiny fix hiyouga 2023-06-27 23:54:24 +0800
18f87c1b25 fix initializing data arguments hiyouga 2023-06-27 22:50:23 +0800
2e01abfda5 support save full model, replace BOS token hiyouga 2023-06-27 21:40:11 +0800
1c732e2537 fix decoding in seq2seq hiyouga 2023-06-27 19:33:08 +0800
33f2141507 Update wechat.jpg hiyouga 2023-06-27 16:41:09 +0800
4f3772b342 Update evaluate.py hiyouga 2023-06-26 23:41:33 +0800
5a0a9daf74 Create evaluate.py hiyouga 2023-06-26 23:30:18 +0800
907e065454

Merge pull request #86 from Jingsong-Yan/main hoshi-hiyouga 2023-06-26 20:14:40 +0800
90bb5b6f37

Update README.md with baichuan-7b-rtx3090 Jingsong-Yan 2023-06-26 19:45:41 +0800
993cabdd4c Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning hiyouga 2023-06-26 18:07:09 +0800
1175948029 fix generation in seq2seq.py hiyouga 2023-06-26 18:07:06 +0800
95b057f5af

Merge pull request #84 from wu-yy/patch-1 hoshi-hiyouga 2023-06-26 15:39:08 +0800
e2a16d549e

Update requirements.txt 蓝鲸123 2023-06-26 15:36:19 +0800
cec9760eb8 support prefixes, loading multiple local files hiyouga 2023-06-26 15:32:40 +0800
785fdc4b3a Support baichuan-7B training on RTX3090 yixue 2023-06-26 06:33:27 +0000
f030b09924 update api hiyouga 2023-06-26 13:39:57 +0800
d21cc71750 Update wechat.jpg hiyouga 2023-06-25 23:41:11 +0800
0697643358 update readme hiyouga 2023-06-23 00:17:05 +0800
614d3a996c update API hiyouga 2023-06-22 20:46:24 +0800
76ecb8c222 match api with OpenAI format hiyouga 2023-06-22 20:27:00 +0800
9324940b76

Merge pull request #68 from mMrBun/main hoshi-hiyouga 2023-06-22 15:52:34 +0800
6e4db0903f Compatible with OpenAI API. Bun 2023-06-21 14:45:04 +0800
ded5aa3c3d Update wechat.jpg hiyouga 2023-06-19 19:46:04 +0800
2f6c64b839

Update README.md Ikko Eltociear Ashimine 2023-06-17 21:49:42 +0900
f621f7631a add default template hiyouga 2023-06-16 21:12:17 +0800
334d1a6d26 add belle multiturn dataset hiyouga 2023-06-16 20:01:16 +0800
a6c4b141cd fix freeze layers hiyouga 2023-06-16 17:38:21 +0800
fc4d8155b3 add source prefix hiyouga 2023-06-16 16:32:17 +0800
0574b590ef support loading lora from hub hiyouga 2023-06-16 00:02:17 +0800
0cee6ad67f support baichuan model hiyouga 2023-06-15 16:02:01 +0800
c527399424 fix bug in template vanilla hiyouga 2023-06-15 14:36:55 +0800
0a36658bb6 Update wechat.jpg hiyouga 2023-06-15 13:48:53 +0800
d668f8b501 add BOS token in pre-training hiyouga 2023-06-15 01:46:17 +0800
b6faf0207d support multiturn training like FastChat hiyouga 2023-06-14 22:27:39 +0800
7e4170f2f5 add server.py to support stream generator api BUAADreamer 2023-06-14 08:59:04 +0800
875e8e2349 fix loading valuehead hiyouga 2023-06-13 11:13:06 +0800
531a3764d9 fix generating args hiyouga 2023-06-13 01:33:56 +0800
cec6524d6b support RM metrics, add generating Args hiyouga 2023-06-12 15:48:48 +0800
e3f380c1be

Merge pull request #26 from BUAADreamer/main hoshi-hiyouga 2023-06-11 19:06:29 +0800
e3b53a67c7 update json line file to .jsonl BUAADreamer 2023-06-11 18:59:19 +0800
676d910260 add some BUAADreamer 2023-06-11 18:55:53 +0800
a2af9df5a9 add code for reading from multi files in one directory BUAADreamer 2023-06-10 16:27:30 +0800
3dd5f9a874 add code for reading from multi files in one directory BUAADreamer 2023-06-10 15:53:47 +0800
6c46a25ff4

add missing package 少年 2023-06-08 19:34:07 +0800
2ba5d69c7f tiny fix hiyouga 2023-06-07 16:42:31 +0800
16c2860d56 tiny fix hiyouga 2023-06-07 16:02:07 +0800
edafb97733 tiny fix hiyouga 2023-06-07 12:58:14 +0800
3875b19a34 add templates hiyouga 2023-06-07 12:40:44 +0800
17acf3a3eb add belle template hiyouga 2023-06-07 12:30:11 +0800
ce43386080 tiny fix hiyouga 2023-06-07 12:08:39 +0800
909af8f496 add prompt template class hiyouga 2023-06-07 11:55:25 +0800
5d021d4ad5 fix inference, add prompt template hiyouga 2023-06-07 10:52:35 +0800
13d1f0709c recover logging hiyouga 2023-06-06 21:36:37 +0800
4eb17bcf6c support distributed quantized training hiyouga 2023-06-06 17:39:41 +0800
3d8d5ee5d5 add API demo from #1 hiyouga 2023-06-05 21:32:18 +0800
06e1b120e1

Merge pull request #11 from hiyouga/api hoshi-hiyouga 2023-06-05 20:58:02 +0800
a38d57ddd7 fix bug in web demo hiyouga 2023-06-05 17:58:29 +0800
56eb99106a increase max length in cli demo hiyouga 2023-06-05 16:49:14 +0800
fe1d930816 implement stream generating hiyouga 2023-06-05 16:43:44 +0800
44298c1235 tiny fix hiyouga 2023-06-05 15:25:22 +0800
38b83533a4 tiny fix hiyouga 2023-06-04 16:35:50 +0800
eac9921e5c tiny fix hiyouga 2023-06-04 12:55:40 +0800
3b9eee8cd2 support QLoRA hiyouga 2023-06-04 00:08:56 +0800
1bd13d7ca1 fix int8 inference hiyouga 2023-06-03 23:22:05 +0800
926291940d reduce repetition penalty hiyouga 2023-06-03 21:57:39 +0800
0f69a0c19e fix int8 inference hiyouga 2023-06-03 21:17:47 +0800
de09ee1315 add ziya prompt template hiyouga 2023-06-03 19:05:51 +0800
771f454ff1 use low_cpu_mem_usage to speed up loading hiyouga 2023-06-03 18:19:01 +0800
dca27b4412 add logits processor hiyouga 2023-06-03 16:34:54 +0800
ed6161fa6a remove unused code hiyouga 2023-06-03 00:10:54 +0800
72a85ccc39 add wechat hiyouga 2023-06-02 21:47:10 +0800
b8a034807e tiny fix hiyouga 2023-06-02 19:02:25 +0800
e3aaef7d4a fix layer norm name in PPO hiyouga 2023-06-02 17:30:01 +0800
bd565af370 fix #1 hiyouga 2023-06-02 14:25:00 +0800
50d9a20f81 alter rewards data type hiyouga 2023-06-02 14:19:51 +0800
e6126244c1 fix possibly OOM error hiyouga 2023-06-01 23:54:44 +0800