cherry0328 commited on
Commit
d7c6442
·
verified ·
1 Parent(s): b5cae03

Update README.md

Browse files

The model is fantastic! We would like to contribute by updating the model README to include the base_model information. This is to address the missing model card. Thank you for your consideration!

Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -1,16 +1,17 @@
1
-
2
  ---
3
  license: apache-2.0
4
  language:
5
- - en
6
  library_name: transformers
7
  tags:
8
- - llama
9
- - dpo
10
- - preference-optimization
11
- - PEFT
12
- - instruction-tuning
13
  pipeline_tag: text-generation
 
 
14
  ---
15
 
16
  # DPO Fine-Tuned Adapter - PairRM Dataset
@@ -37,4 +38,4 @@ pipeline_tag: text-generation
37
  - Size: 500 instructions with `prompt`, `chosen`, and `rejected` columns
38
 
39
  ## 📂 Output
40
- - Adapter saved and uploaded as `Likhith003/dpo-pairrm-lora-adapter`
 
 
1
  ---
2
  license: apache-2.0
3
  language:
4
+ - en
5
  library_name: transformers
6
  tags:
7
+ - llama
8
+ - dpo
9
+ - preference-optimization
10
+ - PEFT
11
+ - instruction-tuning
12
  pipeline_tag: text-generation
13
+ base_model:
14
+ - meta-llama/Llama-3.2-1B-Instruct
15
  ---
16
 
17
  # DPO Fine-Tuned Adapter - PairRM Dataset
 
38
  - Size: 500 instructions with `prompt`, `chosen`, and `rejected` columns
39
 
40
  ## 📂 Output
41
+ - Adapter saved and uploaded as `Likhith003/dpo-pairrm-lora-adapter`