Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +32 -30
  3. distill_orpo.png +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ distill_orpo.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,30 +1,32 @@
1
- ---
2
- license: mit
3
- ---
4
- ## 数据集
5
- ### 下载
6
- ```bash
7
- git lfs install
8
- git clone https://huggingface.co/datasets/juneup/PKU-SafeRLHF-orpo
9
- ```
10
- **若不想克隆大文件:
11
- ```bash
12
- GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/datasets/juneup/PKU-SafeRLHF-orpo
13
- ```
14
-
15
- ## 模型
16
-
17
- ### 下载
18
- ```bash
19
- git lfs install
20
- git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
21
- ```
22
- **若不想克隆大型文件
23
- ```bash
24
- GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
25
- ```
26
-
27
- ### 在Ollama下载
28
- ```bash
29
- ollama run Juneup/internlm2.5_7b_distill:orpo_q4_k_m
30
- ```
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ # internlm2.5_7b_distill_orpo
5
+ ## 架构图
6
+ <div align="center">
7
+ <img src="distill_orpo.png" width="800"/>
8
+ </div>
9
+
10
+ ## 基座模
11
+ https://huggingface.co/internlm/internlm2_5-7b-chat
12
+
13
+ ## 数据集
14
+ ### 数据集组成
15
+ PKU-SafeRLHF(https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF-single-dimension)经处理后最终数据集为(https://huggingface.co/datasets/juneup/PKU-SafeRLHF-orpo-72k)
16
+
17
+ ### 训练方式
18
+ orpo、λ=0.2,lr=5e-6
19
+
20
+ ## 下载模型
21
+ ```bash
22
+ git lfs install
23
+ git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
24
+ ```
25
+ 若不想克隆大型文件
26
+ ```bash
27
+ GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
28
+ ```
29
+
30
+ ### 在Ollama下载
31
+ ```bash
32
+ ollama run Juneup/internlm2.5_7b_distill:orpo_q4_k_m
distill_orpo.png ADDED

Git LFS Details

  • SHA256: 27042ec056d766758d10c487484cf282ee54b0f6904733872b2104842005dc16
  • Pointer size: 131 Bytes
  • Size of remote file: 325 kB