AswanthCManoj commited on
Commit
cea848b
·
verified ·
1 Parent(s): 0586792

azma-OpenHermes-2.5-Mistral-7B-agent-v1

Browse files
Files changed (2) hide show
  1. README.md +6 -6
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
- license: other
3
  library_name: peft
4
  tags:
5
  - trl
6
  - sft
7
  - generated_from_trainer
8
- base_model: deepseek-ai/deepseek-coder-1.3b-instruct
9
  model-index:
10
  - name: results
11
  results: []
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # results
18
 
19
- This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-instruct) on the None dataset.
20
 
21
  ## Model description
22
 
@@ -36,10 +36,10 @@ More information needed
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 0.0002
39
- - train_batch_size: 4
40
- - eval_batch_size: 4
41
  - seed: 42
42
- - gradient_accumulation_steps: 2
43
  - total_train_batch_size: 8
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
 
1
  ---
2
+ license: apache-2.0
3
  library_name: peft
4
  tags:
5
  - trl
6
  - sft
7
  - generated_from_trainer
8
+ base_model: teknium/OpenHermes-2.5-Mistral-7B
9
  model-index:
10
  - name: results
11
  results: []
 
16
 
17
  # results
18
 
19
+ This model is a fine-tuned version of [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B) on the None dataset.
20
 
21
  ## Model description
22
 
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 0.0002
39
+ - train_batch_size: 2
40
+ - eval_batch_size: 2
41
  - seed: 42
42
+ - gradient_accumulation_steps: 4
43
  - total_train_batch_size: 8
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:868e618f88134d6eeb82f017200eebe19d9eed73faacf59d9f24a643787abb67
3
  size 637592768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a27886c0f7303783f3839d529b8c52466fb87acf68a9d1fe9ff96bff644ef538
3
  size 637592768