HarshalH commited on
Commit
e098f90
·
verified ·
1 Parent(s): 5acd241

HarshalH/OnlySFT

Browse files
README.md CHANGED
@@ -3,9 +3,11 @@ library_name: transformers
3
  license: mit
4
  base_model: gpt2
5
  tags:
 
 
6
  - generated_from_trainer
7
  datasets:
8
- - scitldr
9
  model-index:
10
  - name: output_dir
11
  results: []
@@ -16,9 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # output_dir
18
 
19
- This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the scitldr dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 3.3648
22
 
23
  ## Model description
24
 
@@ -37,22 +37,13 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 2e-05
41
- - train_batch_size: 16
42
  - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 3.0
47
-
48
- ### Training results
49
-
50
- | Training Loss | Epoch | Step | Validation Loss |
51
- |:-------------:|:-----:|:----:|:---------------:|
52
- | No log | 1.0 | 125 | 3.4195 |
53
- | No log | 2.0 | 250 | 3.3752 |
54
- | No log | 3.0 | 375 | 3.3648 |
55
-
56
 
57
  ### Framework versions
58
 
 
3
  license: mit
4
  base_model: gpt2
5
  tags:
6
+ - trl
7
+ - sft
8
  - generated_from_trainer
9
  datasets:
10
+ - generator
11
  model-index:
12
  - name: output_dir
13
  results: []
 
18
 
19
  # output_dir
20
 
21
+ This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the generator dataset.
 
 
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.0002
41
+ - train_batch_size: 8
42
  - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 1
 
 
 
 
 
 
 
 
 
47
 
48
  ### Framework versions
49
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4976df38c10136e8a181e491a9238313ff312b3b7bc70932fa975447e1a3f752
3
  size 497774208
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b0e18df8c7682f02febea78168fda466314838159a4cb81ad55f36a26fb89d5d
3
  size 497774208
runs/Oct02_00-41-28_1bd55d887019/events.out.tfevents.1727829691.1bd55d887019.30.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ca3e5fc09f2f9b69cbb5be0f89e5bcdb2472cf49f2313ffb5b479d8132303ed
3
+ size 4184
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c598273ea12d192c47f16251e35c9978e0c43ecc582ecfeab217a4e4bb52e0e2
3
- size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c1eb340bde088335e0b005c4962b0d7d6a01d7ff441befe68f2a4dcb65d35516
3
+ size 5496