LLaMA-Factory-Mirror

Commit Graph

Author	SHA1	Message	Date
hiyouga	577de2fa07	fix #4242	2024-06-12 16:50:11 +08:00
hoshi-hiyouga	656b2bbdaf	Merge pull request #4234 from kimdwkimdw/patch-1 Support vllm==0.5.0	2024-06-12 16:39:09 +08:00
Arthur Kim	d65a3f7cb6	Support vllm==0.5.0	2024-06-12 16:49:12 +09:00
codingma	557891debb	update wechat_npu.jpg	2024-06-12 10:39:05 +08:00
hoshi-hiyouga	9049aab911	Merge pull request #4204 from dignfei/main fixbug：llama3在增量预训练时应该使用<\|end_of_text\|>标识文本的结束	2024-06-11 17:06:10 +08:00
hoshi-hiyouga	0c29233237	Update pretrain.py	2024-06-11 17:02:14 +08:00
hiyouga	cca6f35108	fix deepspeed version	2024-06-11 16:52:36 +08:00
d	6979f3f848	经过大量的增量预训练，进行对比试验，发现这个bug：llama3在预训练时使用的tokenizer.eos_toke是'<\|end_of_text\|>' ，这里在每条数据后面也得用这个，而不是'<\|eot_id\|>'，否则很容易导致严重的性能下降	2024-06-11 16:23:40 +08:00
hiyouga	53b74361d3	Update bug-report.yml	2024-06-11 15:40:21 +08:00
hiyouga	89f2bd8c8c	fix #4198	2024-06-11 15:38:38 +08:00
hiyouga	90e14a960d	tiny fix	2024-06-11 12:48:53 +08:00
hoshi-hiyouga	796699f867	Merge pull request #4191 from iamthebot/al--add_manifest_for_reqs Add MANIFEST.in so requirements.txt is present in sdist	2024-06-11 10:41:15 +08:00
Alfredo Luque	5f7b3b3ff6	add manifest so requirements.txt in sdist	2024-06-11 00:07:06 +00:00
hiyouga	3f24337a8a	tiny fix	2024-06-11 01:04:16 +08:00
hiyouga	91e62a098f	set dev version	2024-06-11 00:50:53 +08:00
hiyouga	2b6ebd6b51	release v0.8.1	2024-06-11 00:44:26 +08:00
hiyouga	a793e8456b	fix #4160 The split heads should be concatenated in dim=2	2024-06-11 00:37:17 +08:00
hiyouga	949e9908ad	fix #4145 Fix the docker image	2024-06-11 00:19:17 +08:00
hiyouga	0012762b04	update evaluator	2024-06-10 23:56:00 +08:00
hiyouga	c907d81667	fix #2666	2024-06-10 21:24:15 +08:00
hoshi-hiyouga	ef4afdaf0e	Merge pull request #4167 from yzoaim/branch fix README	2024-06-10 16:24:33 +08:00
-.-	483cdd9b6a	fix README	2024-06-08 23:51:56 +08:00
hiyouga	b61d25cd70	add pr ci	2024-06-08 21:25:35 +08:00
hiyouga	25c635ef28	Update tests.yml	2024-06-08 21:15:36 +08:00
hiyouga	4c4f950f39	update git workflows	2024-06-08 21:11:32 +08:00
hiyouga	972ec9c668	fix llamafactory-cli env	2024-06-08 07:15:45 +08:00
hiyouga	3ac11e77cc	set dev version	2024-06-08 06:46:09 +08:00
hiyouga	5aa4ce4756	release v0.8.0	2024-06-08 05:20:54 +08:00
hiyouga	12d79f89c5	add ultrafeedback and fineweb #4085 #4132	2024-06-08 02:42:34 +08:00
hiyouga	42d9b26fc8	fix ci	2024-06-08 02:00:44 +08:00
hiyouga	7f20e4722a	fix ci	2024-06-08 01:57:36 +08:00
hiyouga	aa2578bea0	add ci	2024-06-08 01:48:30 +08:00
hiyouga	1c7f0ab519	init unittest	2024-06-08 01:35:58 +08:00
hiyouga	4b55f35662	Delete .readthedocs.yaml	2024-06-08 00:58:10 +08:00
hiyouga	54cd743ebf	reorganize adapter code	2024-06-08 00:47:23 +08:00
hoshi-hiyouga	cfd62283a9	fix #4139	2024-06-08 00:45:02 +08:00
hiyouga	06e5d136a4	add resume args in webui	2024-06-08 00:22:16 +08:00
hiyouga	8bf9da659c	fix #4137	2024-06-07 19:16:06 +08:00
hiyouga	cce0fad91c	Update wechat.jpg	2024-06-07 19:04:16 +08:00
hiyouga	f8d8690bf4	tiny fix	2024-06-07 05:19:21 +08:00
hiyouga	4489d73ac7	fix ppo trainer save zero3 model accelerator.get_state_dict(ds_model) should be called at all ranks	2024-06-07 05:14:19 +08:00
hiyouga	2702d7e952	fix ppo in trl 0.8.6	2024-06-07 04:48:29 +08:00
hiyouga	f9e818d79c	fix #4120	2024-06-07 04:18:05 +08:00
hiyouga	ccc8b64cc2	update data processors	2024-06-07 04:15:40 +08:00
hoshi-hiyouga	181dbb0d05	Merge pull request #4009 from AlongWY/main supervised packing with greedy knapsack algorithm	2024-06-07 03:48:46 +08:00
hoshi-hiyouga	c09ad8bab3	Update supervised.py	2024-06-07 03:42:08 +08:00
hoshi-hiyouga	788e8232fc	Update supervised.py	2024-06-07 03:38:23 +08:00
hoshi-hiyouga	8cecade708	Update supervised.py	2024-06-07 03:38:04 +08:00
hiyouga	8e95648850	add qwen2 models	2024-06-07 00:22:57 +08:00
hiyouga	74f96efef9	rename files	2024-06-07 00:09:06 +08:00

1 2 3 4 5 ...

1677 Commits All Branches Search

1677 Commits

All Branches