Safetensors
llama
zkshan2002 commited on
Commit
71c49be
·
verified ·
1 Parent(s): 2e81574

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - weqweasdas/ultra_train
4
+ base_model:
5
+ - OpenRLHF/Llama-3-8b-sft-mixture
6
+ reward_model:
7
+ - zkshan2002/r1B-sft_tokenizer
8
+ dpo_model:
9
+ - zkshan2002/DPO-uf-llama3-8B-OpenRLHF
10
+ ---