From 7ca0eb0bcc32c0925f35429a1e65d9cd034797c0 Mon Sep 17 00:00:00 2001 From: "chaoyu@qiyuanlab.com" Date: Tue, 16 Jul 2024 18:41:27 +0800 Subject: [PATCH] =?UTF-8?q?=E4=BF=AE=E6=94=B9=E4=B8=80=E4=BA=9B=E6=A0=BC?= =?UTF-8?q?=E5=BC=8F=E9=97=AE=E9=A2=98?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- quick_start_clean/readmes/quick_start.md | 1 + 1 file changed, 1 insertion(+) diff --git a/quick_start_clean/readmes/quick_start.md b/quick_start_clean/readmes/quick_start.md index eebe879..8e75ee2 100644 --- a/quick_start_clean/readmes/quick_start.md +++ b/quick_start_clean/readmes/quick_start.md @@ -148,6 +148,7 @@ cat pretrain.txt | python convert_txt2jsonl.py > pretrain.jsonl ``` 2. jsonl格式转index。脚本位于./quick_start_clean/convert_json2index.py,应用方法如下: + ```shell python convert_json2index.py \ --path ../data_process/data \ #存放jsonl文件的目录