Ba2han commited on
Commit
1643056
·
verified ·
1 Parent(s): d681ede

Training in progress, step 2652

Browse files
Files changed (4) hide show
  1. README.md +5 -5
  2. config.json +1 -1
  3. model.safetensors +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -1,18 +1,18 @@
1
  ---
2
- base_model: Ba2han/test-model-muon
3
  library_name: transformers
4
  model_name: test-model-muon-2
5
  tags:
6
  - generated_from_trainer
7
- - sft
8
  - unsloth
9
  - trl
 
10
  licence: license
11
  ---
12
 
13
  # Model Card for test-model-muon-2
14
 
15
- This model is a fine-tuned version of [Ba2han/test-model-muon](https://huggingface.co/Ba2han/test-model-muon).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
@@ -28,7 +28,7 @@ print(output["generated_text"])
28
 
29
  ## Training procedure
30
 
31
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/batuhan409/huggingface/runs/zqxfs73a)
32
 
33
 
34
  This model was trained with SFT.
@@ -38,7 +38,7 @@ This model was trained with SFT.
38
  - TRL: 0.23.0
39
  - Transformers: 4.57.1
40
  - Pytorch: 2.8.0
41
- - Datasets: 4.4.1
42
  - Tokenizers: 0.22.1
43
 
44
  ## Citations
 
1
  ---
2
+ base_model: Ba2han/test-model-muon-2
3
  library_name: transformers
4
  model_name: test-model-muon-2
5
  tags:
6
  - generated_from_trainer
 
7
  - unsloth
8
  - trl
9
+ - sft
10
  licence: license
11
  ---
12
 
13
  # Model Card for test-model-muon-2
14
 
15
+ This model is a fine-tuned version of [Ba2han/test-model-muon-2](https://huggingface.co/Ba2han/test-model-muon-2).
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
 
28
 
29
  ## Training procedure
30
 
31
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/batuhan409/huggingface/runs/zoqtlr0q)
32
 
33
 
34
  This model was trained with SFT.
 
38
  - TRL: 0.23.0
39
  - Transformers: 4.57.1
40
  - Pytorch: 2.8.0
41
+ - Datasets: 4.3.0
42
  - Tokenizers: 0.22.1
43
 
44
  ## Citations
config.json CHANGED
@@ -42,7 +42,7 @@
42
  "sliding_window": null,
43
  "tie_word_embeddings": true,
44
  "transformers_version": "4.57.1",
45
- "unsloth_version": "2025.11.1",
46
  "use_cache": true,
47
  "use_sliding_window": false,
48
  "vocab_size": 151669
 
42
  "sliding_window": null,
43
  "tie_word_embeddings": true,
44
  "transformers_version": "4.57.1",
45
+ "unsloth_version": "2025.11.2",
46
  "use_cache": true,
47
  "use_sliding_window": false,
48
  "vocab_size": 151669
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5a9737706d3c133c1325aa2128a36f2977b8930e13efb6c1c9e929bf31adabba
3
  size 1111902928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa9a638feb7c1c56973203a1b5416a55d97adae17d7193acd1fc7f0c75109791
3
  size 1111902928
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e1d94381a6d273bb0f34c5ac3061a6bfdd6260785ddb9242bcb2b203b9514562
3
  size 6289
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:798942e6cf5bd42b5726886f1a0166dd1235b9c8beec7da61c99560d041c573b
3
  size 6289