Amir337 commited on
Commit
a5e6e2d
·
verified ·
1 Parent(s): 9b4be19

Amir337/llm-course-hw2-reward-model

Browse files
Files changed (3) hide show
  1. README.md +2 -2
  2. model.safetensors +1 -1
  3. training_args.bin +2 -2
README.md CHANGED
@@ -37,8 +37,8 @@ This model was trained with Reward.
37
 
38
  - TRL: 0.25.1
39
  - Transformers: 4.57.1
40
- - Pytorch: 2.8.0+cu126
41
- - Datasets: 4.0.0
42
  - Tokenizers: 0.22.1
43
 
44
  ## Citations
 
37
 
38
  - TRL: 0.25.1
39
  - Transformers: 4.57.1
40
+ - Pytorch: 2.6.0+cu124
41
+ - Datasets: 4.4.1
42
  - Tokenizers: 0.22.1
43
 
44
  ## Citations
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9a7dc93c33cfa251f04a3c4bca4afd51fd106ff3c4ffa887e3327526f9a6811b
3
  size 538092792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82cd6448ce9dbbbadd4d103371661e2ec50bbe67a7b1c8b769a8f87116d8b12e
3
  size 538092792
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:65be1512ce79df43577a3a9589a5a81730208466434882c55383dfeff3430cac
3
- size 6033
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfc059f525bb527ee7bb6631603dfa0e20728494798f38d6eb1dbba71ad91b9f
3
+ size 5624