Shunchang commited on
Commit
c6ea39b
·
verified ·
1 Parent(s): 8fa85b5

End of training

Browse files
README.md CHANGED
@@ -34,11 +34,11 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0001
37
- - train_batch_size: 8
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - gradient_accumulation_steps: 4
41
- - total_train_batch_size: 32
42
  - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
  - lr_scheduler_warmup_steps: 10
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0001
37
+ - train_batch_size: 4
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - gradient_accumulation_steps: 4
41
+ - total_train_batch_size: 16
42
  - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
  - lr_scheduler_warmup_steps: 10
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5b151357df734ef23aae8fa069e9fd7cf9093cbdceaa01d708743a90afc5326f
3
  size 42220792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ba5aae9f7a384123b57583ebea15eeb4ccffdb5024cbec3ca9001388991b3d3
3
  size 42220792
runs/May26_03-50-47_e521ce3f4686/events.out.tfevents.1748231464.e521ce3f4686.1375.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec25291c7a378cbae621d035cdf687f1627f6267df298b6bd100103c713ea1fd
3
+ size 13302
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ea22801e2421c99fd8a0673bad7cd46cd402e06f16c568688a4b7886887a4461
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fcfc5e145d0ce28eb487cc2819efdb986635d2a53fd56ed62c09e04d0402781f
3
  size 5368