Commit Graph

1984 Commits

Author SHA1 Message Date
hoshi-hiyouga daa62db06f
Merge pull request #4995 from codemayq/fix-pissa
fix pissa callback
2024-07-30 01:47:25 +08:00
codingma 2c1ca9f742 fix pissa save 2024-07-29 10:44:34 +08:00
hiyouga 668654b5ad tiny fix 2024-07-26 11:51:00 +08:00
hoshi-hiyouga 8a2846cfe1
Merge pull request #4892 from piamo/main
update deepseek template
2024-07-26 11:49:34 +08:00
hoshi-hiyouga 9839c6d1f6
Merge pull request #4950 from liuwwang/main and fi
fix: Repair the issue where quantization failed after merging the adapter.
2024-07-26 11:48:56 +08:00
hoshi-hiyouga b8896b9b8b
Merge pull request #4970 from HardAndHeavy/add-rocm
Add ROCm support
2024-07-26 11:41:23 +08:00
hoshi-hiyouga 3c424cf69a
Merge pull request #4961 from khazic/main
Added the reference address for TRL PPO details.
2024-07-26 11:32:29 +08:00
hoshi-hiyouga 77e7bfee79
Update README_zh.md 2024-07-26 11:30:57 +08:00
hoshi-hiyouga 1186ad53d4
Update README.md 2024-07-26 11:29:28 +08:00
hoshi-hiyouga f97beca23a
Update README.md 2024-07-26 11:29:09 +08:00
codemayq 024c49d4e0 update wechat.jpg 2024-07-26 10:01:10 +08:00
HardAndHeavy c8e18a669a Add ROCm support 2024-07-25 21:29:28 +03:00
khazic ceba96f9ed Added the reference address for TRL PPO details. 2024-07-25 09:03:21 +08:00
hiyouga 77cff78863 fix #4959 2024-07-24 23:44:00 +08:00
hiyouga 30f8149d11 update webui 2024-07-24 21:11:51 +08:00
hoshi-hiyouga 71d3e60713
Update README_zh.md 2024-07-24 21:08:42 +08:00
hoshi-hiyouga 5626bdc56d
Update README.md 2024-07-24 21:07:14 +08:00
hiyouga ace1d44857 tiny fix 2024-07-24 18:33:39 +08:00
hiyouga 091010492b fix #4928 2024-07-24 17:00:29 +08:00
hiyouga 935b22d93e fix #4925 2024-07-24 16:56:58 +08:00
hiyouga 1bbd49faae fix #4944 2024-07-24 16:42:51 +08:00
hiyouga 1550fe7331 add mistral nemo model 2024-07-24 16:25:53 +08:00
hiyouga 26533c0604 add llama3.1 2024-07-24 16:20:11 +08:00
Liuww f91a9a250a
fix: Repair the issue where quantization failed after merging the adapter. 2024-07-24 14:31:29 +08:00
hiyouga bb0a37dc06 Update wechat_npu.jpg 2024-07-22 21:17:22 +08:00
hiyouga 5665062ca0 tiny fix 2024-07-22 21:10:15 +08:00
hoshi-hiyouga 26082fc6c9
fix #4917 2024-07-22 11:28:31 +08:00
hiyouga c333e2f49d tiny fix 2024-07-22 00:06:03 +08:00
hiyouga 4135e69406 fix flashattn + packing 2024-07-21 17:07:45 +08:00
hiyouga ad71296a7c update wechat 2024-07-20 22:00:44 +08:00
huangpan.foo 44e48e2b82 update deepseek template 2024-07-19 15:02:54 +08:00
hiyouga 88c7fc1599 set dev version 2024-07-19 02:01:46 +08:00
hiyouga 8f6995081c update parser 2024-07-19 01:36:39 +08:00
hiyouga bbd5a64423 release v0.8.3 2024-07-19 01:21:18 +08:00
hiyouga cdb0f34f10 fix test 2024-07-19 01:17:37 +08:00
hiyouga e80006795f fix unittest 2024-07-19 01:10:30 +08:00
hiyouga 608de799a2 add unittest 2024-07-19 01:06:27 +08:00
hiyouga 779aae83d2 follow #4878 fix #4684 2024-07-18 22:06:12 +08:00
hoshi-hiyouga 2516763d69
Merge pull request #4878 from ly863/main
Train the last turing conversation.
2024-07-18 22:03:41 +08:00
Shiyu Zhang 1e7b396ff2 仅仅训练最后一轮对话 2024-07-18 15:30:25 +08:00
hiyouga beec77a089 fix metrics #4786 2024-07-17 00:47:00 +08:00
hiyouga d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hiyouga bda302fbfb tiny fix 2024-07-15 23:09:50 +08:00
hoshi-hiyouga f2aaebdbde
Merge pull request #4822 from codemayq/test-ci
add github action check to ignore some test cases
2024-07-15 23:07:55 +08:00
hoshi-hiyouga 10289eab15
Update test_template.py 2024-07-15 23:04:39 +08:00
hoshi-hiyouga da990f76b8
Update test_template.py 2024-07-15 23:00:27 +08:00
hoshi-hiyouga 38bc411d42
Merge pull request #4821 from codemayq/feature-eval-split
add "split" as suffix in eval task name
2024-07-15 22:59:44 +08:00
hoshi-hiyouga 91ba083f37
Update llama3_lora_eval.yaml 2024-07-15 22:55:12 +08:00
hoshi-hiyouga 33420bab81
Update test_template.py 2024-07-15 22:55:05 +08:00
hoshi-hiyouga 52a4256ad9
Update test_template.py 2024-07-15 22:52:25 +08:00