diff --git a/README.md b/README.md index 28c1422..b71c6f3 100644 --- a/README.md +++ b/README.md @@ -120,7 +120,7 @@ LawGPT 系列模型的训练过程分为两个阶段: ### 二次训练流程 1. 参考 `data/example_instruction_train.json` 构造指令微调数据集 -2. 运行 `src/scripts/train.sh` +2. 运行 `src/scripts/train_lora.sh` ### 指令精调步骤 @@ -135,9 +135,21 @@ LawGPT 系列模型的训练过程分为两个阶段: ### 输出示例 -
问题:欠了信用卡的钱还不上要坐牢吗? +
问题:请给出判决意见。 -![](assets/demo/example-01.jpeg) +![](assets/demo/example-05.jpeg) + +
+ +
问题:请介绍赌博罪的定义。 + +![](assets/demo/example-06.jpeg) + +
+ +
问题:请问加班工资怎么算? + +![](assets/demo/example-04.jpeg)
@@ -147,12 +159,19 @@ LawGPT 系列模型的训练过程分为两个阶段:
-
问题:你能否写一段抢劫罪罪名的案情描述? +
问题:欠了信用卡的钱还不上要坐牢吗? -![](assets/demo/example-03.jpg) +![](assets/demo/example-01.jpeg)
+
问题:你能否写一段抢劫罪罪名的案情描述? + +![](assets/demo/example-03.jpeg) + +
+ + ### 局限性 由于计算资源、数据规模等因素限制,当前阶段 LawGPT 存在诸多局限性: diff --git a/assets/demo/example-03.jpg b/assets/demo/example-03.jpeg similarity index 100% rename from assets/demo/example-03.jpg rename to assets/demo/example-03.jpeg diff --git a/assets/demo/example-04.jpeg b/assets/demo/example-04.jpeg new file mode 100644 index 0000000..9119130 Binary files /dev/null and b/assets/demo/example-04.jpeg differ diff --git a/assets/demo/example-05.jpeg b/assets/demo/example-05.jpeg new file mode 100644 index 0000000..0880f39 Binary files /dev/null and b/assets/demo/example-05.jpeg differ diff --git a/assets/demo/example-06.jpeg b/assets/demo/example-06.jpeg new file mode 100644 index 0000000..d3a43ea Binary files /dev/null and b/assets/demo/example-06.jpeg differ diff --git a/src/scripts/train.sh b/src/scripts/train.sh index 3826276..56532a2 100644 --- a/src/scripts/train.sh +++ b/src/scripts/train.sh @@ -1,6 +1,6 @@ #!/bin/bash -WORLD_SIZE=8 CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 --master_port=1235 train.py \ +WORLD_SIZE=8 CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 --master_port=1235 train_lora.py \ --base_model '../models/base_models/chinese_llama_7b' \ --data_path '' \ --output_dir '../models/lora_weights' \ diff --git a/src/train.py b/src/train_lora.py similarity index 100% rename from src/train.py rename to src/train_lora.py