Arotte commited on
Commit
870fb52
·
1 Parent(s): 4f09244

Model update

Browse files

New model files after re-training.

README.md CHANGED
@@ -1,6 +1,52 @@
1
  ---
2
  language:
3
  - en
 
 
 
 
 
4
  ---
5
 
6
- OPT small fine-tuned on the MapTask corpus. More details soon.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  language:
3
  - en
4
+ metrics:
5
+ - accuracy
6
+ model-index:
7
+ - name: opt-small-mt
8
+ results: []
9
  ---
10
 
11
+ # opt-small-mt
12
+
13
+ This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the MapTask dataset.
14
+ It achieves the following results on the evaluation set:
15
+ - Loss: 2.2725
16
+ - Accuracy: 0.5641
17
+
18
+ ## Model description
19
+
20
+ More details soon.
21
+
22
+ ## Intended uses & limitations
23
+
24
+ More details soon.
25
+
26
+ ## Training and evaluation data
27
+
28
+ More details soon.
29
+
30
+ ## Training procedure
31
+
32
+ ### Training hyperparameters
33
+
34
+ The following hyperparameters were used during training:
35
+ - learning_rate: 5e-05
36
+ - train_batch_size: 5
37
+ - eval_batch_size: 5
38
+ - seed: 42
39
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
+ - lr_scheduler_type: linear
41
+ - num_epochs: 20.0
42
+
43
+ ### Training results
44
+
45
+
46
+
47
+ ### Framework versions
48
+
49
+ - Transformers 4.26.0
50
+ - Pytorch 1.13.1
51
+ - Datasets 2.9.0
52
+ - Tokenizers 0.13.0.dev0
all_results.json ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 20.0,
3
+ "eval_accuracy": 0.5641330480040158,
4
+ "eval_loss": 2.272531270980835,
5
+ "eval_runtime": 2.2768,
6
+ "eval_samples": 37,
7
+ "eval_samples_per_second": 16.251,
8
+ "eval_steps_per_second": 3.514,
9
+ "perplexity": 9.703933036217537,
10
+ "train_loss": 1.2078121283115484,
11
+ "train_runtime": 699.037,
12
+ "train_samples": 193,
13
+ "train_samples_per_second": 5.522,
14
+ "train_steps_per_second": 1.116
15
+ }
eval_results.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 20.0,
3
+ "eval_accuracy": 0.5641330480040158,
4
+ "eval_loss": 2.272531270980835,
5
+ "eval_runtime": 2.2768,
6
+ "eval_samples": 37,
7
+ "eval_samples_per_second": 16.251,
8
+ "eval_steps_per_second": 3.514,
9
+ "perplexity": 9.703933036217537
10
+ }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d61b14964b5ec78eda011ff866378e18896e4b1189ae401c01a199877f6ba4ee
3
  size 501023389
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c72ccff764b0e477bcc87cfeea0d1fcac7eb3d875e72d3e1a829d18f1841fcc8
3
  size 501023389
train_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 20.0,
3
+ "train_loss": 1.2078121283115484,
4
+ "train_runtime": 699.037,
5
+ "train_samples": 193,
6
+ "train_samples_per_second": 5.522,
7
+ "train_steps_per_second": 1.116
8
+ }
trainer_state.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": null,
3
+ "best_model_checkpoint": null,
4
+ "epoch": 20.0,
5
+ "global_step": 780,
6
+ "is_hyper_param_search": false,
7
+ "is_local_process_zero": true,
8
+ "is_world_process_zero": true,
9
+ "log_history": [
10
+ {
11
+ "epoch": 12.82,
12
+ "learning_rate": 1.794871794871795e-05,
13
+ "loss": 1.4127,
14
+ "step": 500
15
+ },
16
+ {
17
+ "epoch": 20.0,
18
+ "step": 780,
19
+ "total_flos": 2017174487040000.0,
20
+ "train_loss": 1.2078121283115484,
21
+ "train_runtime": 699.037,
22
+ "train_samples_per_second": 5.522,
23
+ "train_steps_per_second": 1.116
24
+ }
25
+ ],
26
+ "max_steps": 780,
27
+ "num_train_epochs": 20,
28
+ "total_flos": 2017174487040000.0,
29
+ "trial_name": null,
30
+ "trial_params": null
31
+ }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:19040fa29db60f3074a55dea25f3f119e5f12ec624834f0c8e2ed0db92b4c94f
3
- size 3579
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7761ea77e189065576a114d2dae4b01cf2ccd9c7baff2953bd4e482e0d25921
3
+ size 3515