Commit Graph

  • 40908a36fa fix: rename optimzer to optimizer moontidef 2024-08-07 10:05:01 +0800
  • f4f9659c86 free buffer qianhao0713 2024-08-02 11:35:48 +0800
  • 55f32dfbf9
    Merge branch 'hiyouga:main' into main moontidef 2024-08-06 00:18:45 +0800
  • b82ecbedd0 fix: fix the deepseekcoder template to avoid repeat problem moontidef 2024-08-05 23:55:45 +0800
  • b7ca6c8dc1 fix #5048 hiyouga 2024-08-05 23:48:19 +0800
  • c2921b9960
    Merge pull request #5037 from codemayq/feature-gemma-2-2b hoshi-hiyouga 2024-08-05 23:27:37 +0800
  • 9e72106391 fix #3998 steven 2024-08-05 10:54:10 +0800
  • dc09d454f2 support gemma-2-2b codingma 2024-08-01 13:45:48 +0800
  • 1c05b847b2 update wechat.jpg codingma 2024-08-01 09:51:47 +0800
  • cfe0652545
    Merge branch 'main' into feature/Support-Qwenvl marko1616 2024-07-31 21:05:15 +0800
  • 7812090363 Support for training lm_head in freeze finetuning_type Sangchun Ha (Patrick) 2024-07-31 21:20:13 +0900
  • 8f43fc1749 arange launch shell qianhao0713 2024-07-30 16:03:37 +0800
  • 3885949a9d update wechat_npu.jpg codingma 2024-07-30 13:45:47 +0800
  • 41d0dfc797
    overwrite training_step for CustomDPOTrainer zzc 2024-07-30 13:43:12 +0800
  • cd420c1938
    Merge pull request #5010 from Eruly/main hoshi-hiyouga 2024-07-30 01:55:54 +0800
  • 06e17eb462
    Merge pull request #4996 from LDLINGLINGLING/main hoshi-hiyouga 2024-07-30 01:55:30 +0800
  • 3a49c76b65
    Update README_zh.md hoshi-hiyouga 2024-07-30 01:55:13 +0800
  • 9e409eadb0
    Update README.md hoshi-hiyouga 2024-07-30 01:53:19 +0800
  • 8d5a41f2cd
    Update README.md hoshi-hiyouga 2024-07-30 01:52:35 +0800
  • daa62db06f
    Merge pull request #4995 from codemayq/fix-pissa hoshi-hiyouga 2024-07-30 01:47:25 +0800
  • 371009e522 Add Korean web UI (llamafactory-cli webui) eruly 2024-07-29 13:47:13 +0000
  • b9ed9d45cc 增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接 liudan 2024-07-29 10:58:28 +0800
  • 2c1ca9f742 fix pissa save codingma 2024-07-29 10:44:34 +0800
  • 668654b5ad tiny fix hiyouga 2024-07-26 11:51:00 +0800
  • 8a2846cfe1
    Merge pull request #4892 from piamo/main hoshi-hiyouga 2024-07-26 11:49:34 +0800
  • 9839c6d1f6
    Merge pull request #4950 from liuwwang/main and fi hoshi-hiyouga 2024-07-26 11:48:56 +0800
  • b8896b9b8b
    Merge pull request #4970 from HardAndHeavy/add-rocm hoshi-hiyouga 2024-07-26 11:41:23 +0800
  • 3c424cf69a
    Merge pull request #4961 from khazic/main hoshi-hiyouga 2024-07-26 11:32:29 +0800
  • 77e7bfee79
    Update README_zh.md hoshi-hiyouga 2024-07-26 11:30:57 +0800
  • 1186ad53d4
    Update README.md hoshi-hiyouga 2024-07-26 11:29:28 +0800
  • f97beca23a
    Update README.md hoshi-hiyouga 2024-07-26 11:29:09 +0800
  • 024c49d4e0 update wechat.jpg codemayq 2024-07-26 10:01:10 +0800
  • c8e18a669a Add ROCm support HardAndHeavy 2024-07-25 21:29:28 +0300
  • ceba96f9ed Added the reference address for TRL PPO details. khazic 2024-07-25 09:03:21 +0800
  • ac709da285
    Merge a12a2f1bcf into 77cff78863 khazzz1c 2024-07-25 00:44:20 +0000
  • a12a2f1bcf
    Merge branch 'hiyouga:main' into main khazzz1c 2024-07-25 08:44:17 +0800
  • 77cff78863 fix #4959 hiyouga 2024-07-24 23:44:00 +0800
  • 30f8149d11 update webui hiyouga 2024-07-24 21:11:51 +0800
  • 71d3e60713
    Update README_zh.md hoshi-hiyouga 2024-07-24 21:08:42 +0800
  • 5626bdc56d
    Update README.md hoshi-hiyouga 2024-07-24 21:07:14 +0800
  • ace1d44857 tiny fix hiyouga 2024-07-24 18:33:39 +0800
  • 79b186b2a1 docs: add Japanese README Ikko Ashimine 2024-07-24 18:42:39 +0900
  • 091010492b fix #4928 hiyouga 2024-07-24 17:00:29 +0800
  • 935b22d93e fix #4925 hiyouga 2024-07-24 16:56:58 +0800
  • 1bbd49faae fix #4944 hiyouga 2024-07-24 16:42:51 +0800
  • 1550fe7331 add mistral nemo model hiyouga 2024-07-24 16:25:53 +0800
  • 26533c0604 add llama3.1 hiyouga 2024-07-24 16:20:11 +0800
  • f91a9a250a
    fix: Repair the issue where quantization failed after merging the adapter. Liuww 2024-07-24 14:31:29 +0800
  • f56dd37f08 Update some data :) WeepingDogel 2024-07-23 22:35:44 +0800
  • 6fbb338a90 add Kira-Pgr 2024-07-23 20:49:13 +0800
  • ac4a31e19e
    Merge branch 'hiyouga:main' into main khazzz1c 2024-07-23 18:36:32 +0800
  • 85a6e0fce9
    Merge pull request #4 from richardodliu/main khazzz1c 2024-07-23 18:36:17 +0800
  • 0526052717 try ppov2 khazic 2024-07-23 16:31:20 +0800
  • bb0a37dc06 Update wechat_npu.jpg hiyouga 2024-07-22 21:17:22 +0800
  • 5665062ca0 tiny fix hiyouga 2024-07-22 21:10:15 +0800
  • 26082fc6c9
    fix #4917 hoshi-hiyouga 2024-07-22 11:28:31 +0800
  • 7b5b32ffb5
    Merge branch 'main' into main Zhangchi Feng 2024-07-22 09:24:30 +0800
  • c333e2f49d tiny fix hiyouga 2024-07-22 00:06:03 +0800
  • 4135e69406 fix flashattn + packing hiyouga 2024-07-21 17:07:45 +0800
  • ad71296a7c update wechat hiyouga 2024-07-20 22:00:44 +0800
  • 9c6587e303 glm4v pairwise dataset support marko1616 2024-07-20 04:11:24 +0800
  • 3f9ccb321c RLHF support. marko1616 2024-07-19 18:55:53 +0800
  • 44e48e2b82 update deepseek template huangpan.foo 2024-07-19 15:02:54 +0800
  • 3c2ecbab75 Conflict fix marko1616 2024-07-19 03:53:04 +0800
  • 88c7fc1599 set dev version hiyouga 2024-07-19 02:01:46 +0800
  • 8f6995081c update parser v0.8.3 hiyouga 2024-07-19 01:36:39 +0800
  • bbd5a64423 release v0.8.3 hiyouga 2024-07-19 01:21:18 +0800
  • cdb0f34f10 fix test hiyouga 2024-07-19 01:17:37 +0800
  • e80006795f fix unittest hiyouga 2024-07-19 01:10:30 +0800
  • 608de799a2 add unittest hiyouga 2024-07-19 01:06:27 +0800
  • 36932ddb55
    Merge branch 'main' into feature/Support-Qwenvl marko1616 2024-07-18 23:40:19 +0800
  • 779aae83d2 follow #4878 fix #4684 hiyouga 2024-07-18 22:06:12 +0800
  • 2516763d69
    Merge pull request #4878 from ly863/main hoshi-hiyouga 2024-07-18 22:03:41 +0800
  • 545e64afe6 Update metric.py 01WarpDrive 2024-07-18 16:10:15 +0800
  • 1e7b396ff2 仅仅训练最后一轮对话 Shiyu Zhang 2024-07-18 15:30:25 +0800
  • beec77a089 fix metrics #4786 hiyouga 2024-07-17 00:47:00 +0800
  • d774b94f12 support batch_eval_metrics, fix #4826 hiyouga 2024-07-17 00:33:00 +0800
  • 0543306b1c
    Merge pull request #5 from ZJLab-DataHub-Security/qianhao qianhao 2024-07-16 18:25:02 +0800
  • bda302fbfb tiny fix hiyouga 2024-07-15 23:09:50 +0800
  • abdc2fa1f1
    Merge branch 'hiyouga:main' into main Zhangchi Feng 2024-07-15 23:09:02 +0800
  • f2aaebdbde
    Merge pull request #4822 from codemayq/test-ci hoshi-hiyouga 2024-07-15 23:07:55 +0800
  • 10289eab15
    Update test_template.py hoshi-hiyouga 2024-07-15 23:04:39 +0800
  • da990f76b8
    Update test_template.py hoshi-hiyouga 2024-07-15 23:00:27 +0800
  • 38bc411d42
    Merge pull request #4821 from codemayq/feature-eval-split hoshi-hiyouga 2024-07-15 22:59:44 +0800
  • 91ba083f37
    Update llama3_lora_eval.yaml hoshi-hiyouga 2024-07-15 22:55:12 +0800
  • 33420bab81
    Update test_template.py hoshi-hiyouga 2024-07-15 22:55:05 +0800
  • 52a4256ad9
    Update test_template.py hoshi-hiyouga 2024-07-15 22:52:25 +0800
  • fd8cc49008 fix #4820 hiyouga 2024-07-15 22:32:07 +0800
  • b0aa321a4a update wechat hiyouga 2024-07-15 22:02:52 +0800
  • 89c28fb65f fix compute_loss for cpt qianhao0713 2024-07-15 20:30:09 +0800
  • 70bd600d8c add cpt test launch shell qianhao0713 2024-07-15 19:39:25 +0800
  • 34f70cec65 add dp&sp hybrid for cpt qianhao0713 2024-07-15 19:08:29 +0800
  • ca44c8dde8 solve the predict problem of llava-next-video and the multi-gpu finetuning problem of idefics2 BUAADreamer 2024-07-15 17:27:37 +0800
  • 92554c2d42
    Merge pull request #4 from ZJLab-DataHub-Security/qianhao_dev qianhao 2024-07-15 17:15:46 +0800
  • d5563d3030
    Merge branch 'hiyouga:main' into main Zhangchi Feng 2024-07-15 16:22:59 +0800
  • 9b360dde49
    Merge pull request #3 from ZJLab-DataHub-Security/qianhao luckyqsz 2024-07-15 16:10:30 +0800
  • 4a4ea30960 fix bug qianhao0713 2024-07-15 16:08:37 +0800
  • d31c8f764c fix 70b launch shell qianhao0713 2024-07-15 15:49:58 +0800
  • d5e513528e rename variables qianhao0713 2024-07-15 15:39:24 +0800
  • 32c3afdfa1 add IN_GITHUB_ACTIONS codingma 2024-07-15 10:28:07 +0800