JuIm commited on
Commit
48862dc
·
verified ·
1 Parent(s): f1f9b30

End of training

Browse files
README.md CHANGED
@@ -1,8 +1,51 @@
1
  ---
2
- library_name: transformers
3
- tags: []
 
 
 
 
4
  ---
5
 
6
- # Model Card for Model ID
 
7
 
8
- This is nothing
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model: JuIm/ProGemma2
3
+ tags:
4
+ - generated_from_trainer
5
+ model-index:
6
+ - name: ProGemma2
7
+ results: []
8
  ---
9
 
10
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
+ should probably proofread and complete it, then remove this comment. -->
12
 
13
+ # ProGemma2
14
+
15
+ This model is a fine-tuned version of [JuIm/ProGemma2](https://huggingface.co/JuIm/ProGemma2) on an unknown dataset.
16
+
17
+ ## Model description
18
+
19
+ More information needed
20
+
21
+ ## Intended uses & limitations
22
+
23
+ More information needed
24
+
25
+ ## Training and evaluation data
26
+
27
+ More information needed
28
+
29
+ ## Training procedure
30
+
31
+ ### Training hyperparameters
32
+
33
+ The following hyperparameters were used during training:
34
+ - learning_rate: 0.001
35
+ - train_batch_size: 2
36
+ - eval_batch_size: 8
37
+ - seed: 42
38
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
+ - lr_scheduler_type: linear
40
+ - lr_scheduler_warmup_ratio: 0.4
41
+ - training_steps: 3500
42
+
43
+ ### Training results
44
+
45
+
46
+
47
+ ### Framework versions
48
+
49
+ - Transformers 4.42.4
50
+ - Pytorch 2.3.1+cu121
51
+ - Tokenizers 0.19.1
config.json CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "architectures": [
3
  "Gemma2ForCausalLM"
4
  ],
 
1
  {
2
+ "_name_or_path": "JuIm/ProGemma2",
3
  "architectures": [
4
  "Gemma2ForCausalLM"
5
  ],
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3974a8ac95ebc1bf758b8eb9c9c4918061fcea899abe57084eb4362ac5539a41
3
  size 1342562152
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7b2ef1306ff93312b0e88fc03f3b1f783ced72bbfaa8ccd31999138a0269a268
3
  size 1342562152
runs/Aug24_23-34-34_90b50575a410/events.out.tfevents.1724542481.90b50575a410.351.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1fe9d9eeba615399b99ba26bf01f3f4a6e5b8217c66fab7eeba0bf9f734651de
3
+ size 743330
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ea53c7e5cdb50f04ed12b33e3210d185e73e4922fe53355fcbdda3c4a56a8b8
3
+ size 5112