DeepDream2045 commited on
Commit
3971683
·
verified ·
1 Parent(s): b6ad951

End of training

Browse files
Files changed (2) hide show
  1. README.md +3 -3
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -105,7 +105,7 @@ xformers_attention: true
105
 
106
  This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on the None dataset.
107
  It achieves the following results on the evaluation set:
108
- - Loss: 0.7643
109
 
110
  ## Model description
111
 
@@ -143,8 +143,8 @@ The following hyperparameters were used during training:
143
  | Training Loss | Epoch | Step | Validation Loss |
144
  |:-------------:|:------:|:----:|:---------------:|
145
  | 1.7304 | 0.0181 | 1 | 3.1888 |
146
- | 1.4634 | 0.4535 | 25 | 0.9610 |
147
- | 1.0176 | 0.9070 | 50 | 0.7643 |
148
 
149
 
150
  ### Framework versions
 
105
 
106
  This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on the None dataset.
107
  It achieves the following results on the evaluation set:
108
+ - Loss: 0.7738
109
 
110
  ## Model description
111
 
 
143
  | Training Loss | Epoch | Step | Validation Loss |
144
  |:-------------:|:------:|:----:|:---------------:|
145
  | 1.7304 | 0.0181 | 1 | 3.1888 |
146
+ | 1.4272 | 0.4535 | 25 | 0.9794 |
147
+ | 1.0303 | 0.9070 | 50 | 0.7738 |
148
 
149
 
150
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bcd40e360ccfb92a28b2b9be4932c3da4e983b4d63e822ed3f59aa7d2f196712
3
  size 4532162
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:962a911ea7dc10ae200c7d27810348816e301432464c0b1e7b700804d7bd1517
3
  size 4532162