Ba2han
/

model-sft-4096

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Ba2han commited on Dec 9, 2025

Commit

3251b0c

·

verified ·

1 Parent(s): 915a5bc

Training in progress, step 4155

Files changed (4) hide show

README.md +9 -8
config.json +2 -2
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,17 +1,18 @@
 ---
 library_name: transformers
 model_name: model-sft-4096
 tags:
 - generated_from_trainer
-- trl
 - unsloth
 - sft
 licence: license
 ---
 # Model Card for model-sft-4096
-This model is a fine-tuned version of [None](https://huggingface.co/None).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -27,18 +28,18 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/batuhan409/huggingface/runs/smjyp32k)
 This model was trained with SFT.
 ### Framework versions
-- TRL: 0.24.0
-- Transformers: 4.57.2
-- Pytorch: 2.9.1
-- Datasets: 4.3.0
-- Tokenizers: 0.22.1
 ## Citations

 ---
+base_model: Ba2han/model-phase2-4096
 library_name: transformers
 model_name: model-sft-4096
 tags:
 - generated_from_trainer
 - unsloth
+- trl
 - sft
 licence: license
 ---
 # Model Card for model-sft-4096
+This model is a fine-tuned version of [Ba2han/model-phase2-4096](https://huggingface.co/Ba2han/model-phase2-4096).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/batuhan409/huggingface/runs/xv3hwg0z)
 This model was trained with SFT.
 ### Framework versions
+- TRL: 0.23.0
+- Transformers: 4.56.1
+- Pytorch: 2.8.0
+- Datasets: 4.2.0
+- Tokenizers: 0.22.0
 ## Citations

config.json CHANGED Viewed

@@ -24,8 +24,8 @@
   "rope_scaling": null,
   "rope_theta": 100000.0,
   "tie_word_embeddings": true,
-  "transformers_version": "4.57.2",
-  "unsloth_version": "2025.11.6",
   "use_cache": true,
   "vocab_size": 65536
 }

   "rope_scaling": null,
   "rope_theta": 100000.0,
   "tie_word_embeddings": true,
+  "transformers_version": "4.56.1",
+  "unsloth_version": "2025.10.10",
   "use_cache": true,
   "vocab_size": 65536
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d8e836a3ea01d6392da85a4b90ea922479a4a583a39afa2bc4f368ddb209647d
 size 1000555808

 version https://git-lfs.github.com/spec/v1
+oid sha256:e35c5fcf3cf67fc84de6ee51193bd64a5075e0209f07e3ed81e4f6580f03d40d
 size 1000555808

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:076a24e17a0bfdb2a34171d30bfaa988c52bb3a902f9f94012008c58f037ded1
 size 6289

 version https://git-lfs.github.com/spec/v1
+oid sha256:6838e9fbcc948cfbcc3712684628a08224c0c1e4ae3195c744ea4725f918f8e0
 size 6289