Muhammed164 commited on
Commit
97db53a
·
verified ·
1 Parent(s): 7a13ffa

Training in progress, step 100

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. adapter_model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -28,7 +28,7 @@ print(output["generated_text"])
28
 
29
  ## Training procedure
30
 
31
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/maymn0535-none/huggingface/runs/rnwpzpnd)
32
 
33
 
34
  This model was trained with DPO, a method introduced in [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://huggingface.co/papers/2305.18290).
 
28
 
29
  ## Training procedure
30
 
31
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/maymn0535-none/huggingface/runs/ic3pxgaa)
32
 
33
 
34
  This model was trained with DPO, a method introduced in [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://huggingface.co/papers/2305.18290).
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6578cecfe50e7d3a20d040c9b1cdda32c58f9cfcb54fdc5334ee95fe69c2eddf
3
  size 204500912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:96db5154f98f00e2835e99ed5a8fbdf293d8f63243a6f707c81db39c6c06f0a2
3
  size 204500912
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fa5979d784b3be5f03398730b0db9a0aaad24ae1fdea10accf8ecc4f7c831b44
3
  size 6289
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0e9844d94b2ddeacda52d988f72cd6b4206ea325ad209511ab311437d2b42ef
3
  size 6289