JuIm commited on
Commit
7dade35
·
verified ·
1 Parent(s): e77bd93

End of training

Browse files
README.md CHANGED
@@ -12,12 +12,37 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # ProGemma2
14
 
15
- Identical to JuIm/ProGemma, save for 2 details:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
- 1. The model is slightly larger at 336M parameters vs 225M
18
- 2. The training rate is 1e-3
19
 
20
- Current training loss is 2.04, with only 20% of the training data being used thus far (1st epoch), which is a marked improvement versus the original ProGemma's training loss at this point in the dataset.
21
 
22
  ### Framework versions
23
 
 
12
 
13
  # ProGemma2
14
 
15
+ This model is a fine-tuned version of [JuIm/ProGemma2](https://huggingface.co/JuIm/ProGemma2) on an unknown dataset.
16
+
17
+ ## Model description
18
+
19
+ More information needed
20
+
21
+ ## Intended uses & limitations
22
+
23
+ More information needed
24
+
25
+ ## Training and evaluation data
26
+
27
+ More information needed
28
+
29
+ ## Training procedure
30
+
31
+ ### Training hyperparameters
32
+
33
+ The following hyperparameters were used during training:
34
+ - learning_rate: 0.001
35
+ - train_batch_size: 2
36
+ - eval_batch_size: 8
37
+ - seed: 42
38
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
+ - lr_scheduler_type: linear
40
+ - lr_scheduler_warmup_ratio: 0.4
41
+ - training_steps: 3500
42
+
43
+ ### Training results
44
 
 
 
45
 
 
46
 
47
  ### Framework versions
48
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:524819aa1c4139d2aef89e1a15459a7fd5a19fba72abd7151aed2ef75ea93b49
3
  size 1342562152
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:468303eca8c6b8cc95b5b506de75a9934a4971f83002cbb3ef238aad519560d3
3
  size 1342562152
runs/Aug30_19-15-33_6ed2723a8973/events.out.tfevents.1725045337.6ed2723a8973.330.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:637a8d0cb88097ef2f28663485c23f7237998ae60f22f88c0ee998414d917edc
3
+ size 743330
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f82693a5bddefae6d93466903dd3b93b7a2eae558a42057b5c4581f02fb4525a
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a9392bea1909c65e815de7e13807a98aef0e6419fc77ed4d9cf80f5952c4f30
3
  size 5112