End of training

Browse files

Files changed (6) hide show

README.md +70 -0
all_results.json +10 -0
model.safetensors +1 -1
runs/Nov18_20-34-35_azazia/events.out.tfevents.1763498075.azazia.69531.0 +2 -2
runs/Nov18_20-34-35_azazia/events.out.tfevents.1763498395.azazia.69531.1 +3 -0
test_results.json +10 -0

README.md ADDED Viewed

	@@ -0,0 +1,70 @@

+---
+library_name: transformers
+tags:
+- generated_from_trainer
+model-index:
+- name: reward-model-1
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# reward-model-1
+This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0609
+- Mse: 0.0609
+- R2: 0.5447
+- Pearson: 0.7406
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- num_epochs: 10
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Mse    | R2     | Pearson |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:-------:|
+| 0.1237        | 0.8   | 100  | 0.1100          | 0.1100 | 0.1781 | 0.5916  |
+| 0.0675        | 1.6   | 200  | 0.0723          | 0.0723 | 0.4597 | 0.6906  |
+| 0.0562        | 2.4   | 300  | 0.0684          | 0.0684 | 0.4890 | 0.7094  |
+| 0.0625        | 3.2   | 400  | 0.0650          | 0.0650 | 0.5145 | 0.7175  |
+| 0.0563        | 4.0   | 500  | 0.0662          | 0.0662 | 0.5055 | 0.7120  |
+| 0.0478        | 4.8   | 600  | 0.0616          | 0.0616 | 0.5396 | 0.7398  |
+| 0.0454        | 5.6   | 700  | 0.0634          | 0.0634 | 0.5266 | 0.7264  |
+| 0.0429        | 6.4   | 800  | 0.0607          | 0.0607 | 0.5467 | 0.7404  |
+| 0.0422        | 7.2   | 900  | 0.0615          | 0.0615 | 0.5405 | 0.7429  |
+| 0.0421        | 8.0   | 1000 | 0.0622          | 0.0622 | 0.5353 | 0.7338  |
+| 0.0423        | 8.8   | 1100 | 0.0610          | 0.0610 | 0.5446 | 0.7424  |
+| 0.0485        | 9.6   | 1200 | 0.0610          | 0.0610 | 0.5445 | 0.7416  |
+### Framework versions
+- Transformers 4.53.3
+- Pytorch 2.9.0+cu128
+- Datasets 3.3.2
+- Tokenizers 0.21.4

all_results.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "epoch": 10.0,
+    "eval_loss": 0.060941558331251144,
+    "eval_mse": 0.060941558331251144,
+    "eval_pearson": 0.7406131625175476,
+    "eval_r2": 0.5447108745574951,
+    "eval_runtime": 3.9474,
+    "eval_samples_per_second": 50.667,
+    "eval_steps_per_second": 6.333
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f61a8a1394e9321be93513550c0b8a8bb6245230616671b4217b22d28fb31a9e
 size 3593828

 version https://git-lfs.github.com/spec/v1
+oid sha256:f45d244b4f6c68bfc1bc872993d1a78c00c08eb1fc5d15aad6b8d42b0e672063
 size 3593828

runs/Nov18_20-34-35_azazia/events.out.tfevents.1763498075.azazia.69531.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6b80481b596862de24dc40fee0c71c57a4b518b5f0700f6aabc365b046e90c99
-size 11766

 version https://git-lfs.github.com/spec/v1
+oid sha256:a6d097e25ce3005be6fb8d9352fbf7354a8f2bf32d9c3db2aabcd159c7ba5139
+size 15679

runs/Nov18_20-34-35_azazia/events.out.tfevents.1763498395.azazia.69531.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ce962ee5d26073ef785631016e1d59a20732dd0ac3c3c35c53a332f5180e5a9b
+size 918

test_results.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "epoch": 10.0,
+    "eval_loss": 0.060941558331251144,
+    "eval_mse": 0.060941558331251144,
+    "eval_pearson": 0.7406131625175476,
+    "eval_r2": 0.5447108745574951,
+    "eval_runtime": 3.9474,
+    "eval_samples_per_second": 50.667,
+    "eval_steps_per_second": 6.333
+}