AngelRaychev commited on
Commit
ae0e976
·
verified ·
1 Parent(s): fd26ca4

End of training

Browse files
Files changed (4) hide show
  1. README.md +2 -2
  2. config.json +1 -1
  3. pytorch_model.bin +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- base_model: AngelRaychev/1.5B-policy-iteration_0
3
  library_name: transformers
4
  model_name: 1.5B-policy-iteration_1
5
  tags:
@@ -11,7 +11,7 @@ licence: license
11
 
12
  # Model Card for 1.5B-policy-iteration_1
13
 
14
- This model is a fine-tuned version of [AngelRaychev/1.5B-policy-iteration_0](https://huggingface.co/AngelRaychev/1.5B-policy-iteration_0).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
1
  ---
2
+ base_model: AngelRaychev/1.5B-policy-iteration_1
3
  library_name: transformers
4
  model_name: 1.5B-policy-iteration_1
5
  tags:
 
11
 
12
  # Model Card for 1.5B-policy-iteration_1
13
 
14
+ This model is a fine-tuned version of [AngelRaychev/1.5B-policy-iteration_1](https://huggingface.co/AngelRaychev/1.5B-policy-iteration_1).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
config.json CHANGED
@@ -11,7 +11,7 @@
11
  "intermediate_size": 8960,
12
  "max_position_embeddings": 131072,
13
  "max_window_layers": 28,
14
- "model_card": "\nFinal Loss: 0.1511\nBatch Size: 128\nLearning Rate: 5e-05\nDataset Size: 12000\n",
15
  "model_type": "qwen2",
16
  "num_attention_heads": 12,
17
  "num_hidden_layers": 28,
 
11
  "intermediate_size": 8960,
12
  "max_position_embeddings": 131072,
13
  "max_window_layers": 28,
14
+ "model_card": "\nFinal Loss: 0.1191\nBatch Size: 128\nLearning Rate: 2e-05\nDataset Size: 12000\n",
15
  "model_type": "qwen2",
16
  "num_attention_heads": 12,
17
  "num_hidden_layers": 28,
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:367a94f556dc5a20a5d38b0f5fecd8d8735c49a5ccd4c83340cef93c3eb0796c
3
  size 3087542418
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c5ed8b1c575f7004b2a8e20b5ba77d2cca1c10408af1ecc44436242d5c506063
3
  size 3087542418
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f7e6f8038e2971c4021b1029ebc2b09d36658ca526e9b43678814b4088b5117b
3
  size 5624
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d6b3a3c47144ece8459524a9751875282489b14b993cf1c9e48cab88ecd976a
3
  size 5624