chaley22 commited on
Commit
1c2b88a
·
verified ·
1 Parent(s): fa35132

End of training

Browse files
README.md CHANGED
@@ -9,12 +9,12 @@ model-index:
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
  should probably proofread and complete it, then remove this comment. -->
11
 
12
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/colemanhaley/pali-captioning-lm-sweep/runs/m2h9av6c)
13
  # pali-captioning-lm-sweep
14
 
15
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: nan
18
 
19
  ## Model description
20
 
@@ -33,7 +33,7 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - learning_rate: 0.1
37
  - train_batch_size: 8
38
  - eval_batch_size: 4
39
  - seed: 42
@@ -47,13 +47,13 @@ The following hyperparameters were used during training:
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:------:|:-----:|:---------------:|
49
  | No log | 0 | 0 | 10.3383 |
50
- | 0.0 | 0.0040 | 10000 | nan |
51
- | 0.0 | 0.0081 | 20000 | nan |
52
- | 0.0 | 0.0121 | 30000 | nan |
53
- | 0.0 | 0.0162 | 40000 | nan |
54
- | 0.0 | 0.0202 | 50000 | nan |
55
- | 0.0 | 0.0242 | 60000 | nan |
56
- | 0.0 | 0.0283 | 70000 | nan |
57
 
58
 
59
  ### Framework versions
 
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
  should probably proofread and complete it, then remove this comment. -->
11
 
12
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/colemanhaley/pali-captioning-lm-sweep/runs/93x0gbhc)
13
  # pali-captioning-lm-sweep
14
 
15
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 2.9665
18
 
19
  ## Model description
20
 
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 0.0001
37
  - train_batch_size: 8
38
  - eval_batch_size: 4
39
  - seed: 42
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:------:|:-----:|:---------------:|
49
  | No log | 0 | 0 | 10.3383 |
50
+ | 1.9493 | 0.0040 | 10000 | 3.4608 |
51
+ | 1.8267 | 0.0081 | 20000 | 3.3065 |
52
+ | 1.6539 | 0.0121 | 30000 | 3.2140 |
53
+ | 1.6238 | 0.0162 | 40000 | 3.1555 |
54
+ | 1.5725 | 0.0202 | 50000 | 3.0853 |
55
+ | 1.4285 | 0.0242 | 60000 | 3.0110 |
56
+ | 1.4442 | 0.0283 | 70000 | 2.9665 |
57
 
58
 
59
  ### Framework versions
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d7209cb2663689c67aa5841f345791f047f941a929191e048da25244702392e2
3
  size 4921596664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:896cfd6810f6407fa67d4d73970d9bed5a2815c6fc7065362893dfa31cc2ec88
3
  size 4921596664
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fc4ad51a0432fd6ae27571e0999b0d99e8bbcf3bdfd1101cb93e446ca927c881
3
  size 4978830584
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c2f0f450259d85988d2a0a219d1302b2b2a978d9756e35a3f4ab74b9e12a491
3
  size 4978830584
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:280a4b9ac96c4ca2143cb0fae3c4441361d26c3e02fcb306a366a3d2a48aeab0
3
  size 134242760
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f1eafc957930cbaae7388b79956cec227873538880eef5cb40b55067387eb25
3
  size 134242760
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d24310eeccaf7b8951e9b7e4c015ab191da00a70fa36c5d21ab1f1e893c82ad6
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e23a56e9733e2bf06927df3e7ac62cfbd891f9085c26302d310fdc043f50bc2b
3
  size 5176