GODELEV
/

Generator3B-V0.1

Text Generation

text-generation-inference

Model card Files Files and versions

GODELEV commited on Jan 2

Commit

0f6d1b9

·

verified ·

1 Parent(s): 00aa281

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 license: apache-2.0
-base_model: HuggingFaceTB/SmolLM3-3B-Instruct
 tags:
 - trl
 - unsloth
@@ -9,11 +10,13 @@ tags:
 - transformers
 library_name: transformers
 model_name: Generator3B-V0.1
 ---
 # Generator3B-V0.1
-This model is a fine-tuned version of [HuggingFaceTB/SmolLM3-3B-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM3-3B-Instruct).
 It was trained using **Unsloth** and the **SFTTrainer** on a specialized 'Golden Dataset' for high-quality instruction following.
 ## Model Details
@@ -28,4 +31,4 @@ The model was trained with maximum speed optimizations including:
 - **Sequence Packing**: Enabled
 - **4-bit Quantization**: Bitsandbytes
 - **LoRA Rank**: 64
-- **Optimizers**: AdamW 8-bit

 ---
 license: apache-2.0
+base_model:
+- HuggingFaceTB/SmolLM3-3B
 tags:
 - trl
 - unsloth
 - transformers
 library_name: transformers
 model_name: Generator3B-V0.1
+datasets:
+- GODELEV/Golden-Dataset-Beta3
 ---
 # Generator3B-V0.1
+This model is a fine-tuned version of [HuggingFaceTB/SmolLM3-3B](https://huggingface.co/HuggingFaceTB/SmolLM3-3B).
 It was trained using **Unsloth** and the **SFTTrainer** on a specialized 'Golden Dataset' for high-quality instruction following.
 ## Model Details
 - **Sequence Packing**: Enabled
 - **4-bit Quantization**: Bitsandbytes
 - **LoRA Rank**: 64
+- **Optimizers**: AdamW 8-bit