Commit Graph

132 Commits

Author SHA1 Message Date
wql d1718878af fix: rerun jsonl_to_json.py 2024-08-12 07:39:07 +00:00
wql 0da139a06b add: add alpaca_zh.json 2024-08-12 07:26:30 +00:00
hiyouga c75b5b83c4 add magpie ultra dataset 2024-08-09 20:28:55 +08:00
hiyouga 608de799a2 add unittest 2024-07-19 01:06:27 +08:00
hiyouga 29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hoshi-hiyouga 9d64507bd5
Update README.md 2024-07-14 21:27:04 +08:00
codingma 76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga 9ab0401948 update data 2024-06-19 02:48:43 +08:00
hiyouga 344b9a36b2 tiny fix 2024-06-18 23:32:18 +08:00
Eli Costa 74e49cca95
Add Magpie and Webinstruct dataset samples
Adds two dataset samples claimed superior performance: Magpie (from Allen AI) and Webinstruct (from TIGER-Lab).
2024-06-15 19:31:56 -03:00
hiyouga c7a5620ccc add neo-sft dataset 2024-06-13 01:00:56 +08:00
hiyouga 12d79f89c5 add ultrafeedback and fineweb #4085 #4132 2024-06-08 02:42:34 +08:00
hoshi-hiyouga 483eb47e5d
Merge pull request #3829 from seanzhang-zhichen/add_dataset_sample_num
Add dataset sample num
2024-05-30 00:25:45 +08:00
hoshi-hiyouga c8ae7e0e65
Update README_zh.md 2024-05-30 00:04:47 +08:00
hoshi-hiyouga 3761d7d5dd
Update README.md 2024-05-30 00:04:26 +08:00
hiyouga 08564838bd fix full/freeze tuning for mllm 2024-05-27 20:37:57 +08:00
BUAADreamer 576b0206c2 Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory 2024-05-27 20:11:23 +08:00
BUAADreamer e2022ce4e9
Merge branch 'hiyouga:main' into main 2024-05-27 20:10:58 +08:00
BUAADreamer f665342a27 remove mllm_pt_demo.json 2024-05-27 20:10:31 +08:00
hiyouga 08bd0440b5 add llava 1k datasets 2024-05-27 19:57:33 +08:00
seanzhang-zhichen 27cb51f7f8
Merge branch 'main' into add_dataset_sample_num 2024-05-24 15:57:47 +08:00
BUAADreamer 8d53ec2b5f
Merge branch 'hiyouga:main' into main 2024-05-21 22:18:20 +08:00
hiyouga 4d647ddba5 Update README_zh.md 2024-05-21 18:30:59 +08:00
BUAADreamer 29a6d5bdb8 support pretraining of llava 2024-05-21 08:57:14 +08:00
hiyouga 7262679666 fix #3818 2024-05-20 21:43:19 +08:00
zhangzc d956041640 fix conflict 2024-05-20 17:10:01 +08:00
hiyouga ca48f90f1e update data readme 2024-05-18 21:37:38 +08:00
hiyouga 18cbf8561d update data readme 2024-05-18 21:15:20 +08:00
hiyouga c450ee87a3 improve KTO impl., replace datasets 2024-05-18 03:44:56 +08:00
enji.zhou db1d5a4f51 add kto 2024-05-17 13:09:17 +08:00
hiyouga 58c522cd5c remove checksum and fix ui args 2024-05-12 01:10:30 +08:00
codingma d5520b6017 fix sha1 of glaive_toolcall dataset 2024-05-09 16:33:45 +08:00
hiyouga 1ccbfe562d remove big file 2024-05-07 22:14:06 +08:00
hiyouga 09f3ef1de4 fix stop param 2024-05-07 00:41:04 +08:00
hoshi-hiyouga d6ca7853fa
Merge pull request #3588 from ZeyuTeng96/patch-1
update hf_hub_url for nectar_rm in dataset_info
2024-05-07 00:06:11 +08:00
hoshi-hiyouga c3910ab98a
Update dataset_info.json 2024-05-07 00:05:45 +08:00
hiyouga f02f87c6fb update example docs 2024-05-06 22:51:02 +08:00
ZeyuTeng96 044af36442
update hf_hub_url for nectar_rm in dataset_info
Hi there,

I cannot find the "mlinmg/RLAIF-Nectar" on hf, seems like it changed as "AstraMindAI/RLAIF-Nectar". So, making a PR for updating.

See: https://huggingface.co/datasets/AstraMindAI/RLAIF-Nectar
2024-05-06 16:44:50 +08:00
hoshi-hiyouga d4d9180c40
Update README_zh.md 2024-05-02 02:14:55 +08:00
hoshi-hiyouga b072ec9d1b
Update README.md 2024-05-02 02:13:46 +08:00
Lao ce17eccf45
Update README_zh.md 2024-04-28 23:31:37 +08:00
khazic 288911fc7b Upgrade the second sharegpt format 2024-04-28 14:30:05 +08:00
khazic d1ba32e4bb added the second sharegpt format 2024-04-28 14:27:45 +08:00
hiyouga 5ee04d418c update readme 2024-04-26 23:39:19 +08:00
hoshi-hiyouga 8f91420223
Merge pull request #3471 from BUAADreamer/main
add llava_150k en/zh mllm sft data
2024-04-26 23:36:41 +08:00
hoshi-hiyouga c29b257007
Update dataset_info.json 2024-04-26 23:34:34 +08:00
BUAADreamer a177872010 add llava_150k en/zh mllm sft data 2024-04-26 23:18:58 +08:00
hiyouga 168f56683a release v0.7.0 2024-04-26 23:18:00 +08:00
hiyouga e057c8de48 support mllm hf inference 2024-04-26 05:34:58 +08:00
hoshi-hiyouga f8c26e6a34
Update dataset_info.json 2024-04-26 03:03:36 +08:00