Muhammad-Shaheer
/

FinetunedLAMAtoR1-001-3B

Text Generation

chain-of-thought

Model card Files Files and versions

Muhammad-Shaheer commited on 27 days ago

Commit

155ebc2

·

verified ·

1 Parent(s): b86a247

updated modelcard

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -19,7 +19,26 @@ language:
 # Model Card for FinetunedLAMAtoR1-001-3B
 ## Model Details
 ### Model Description
 This model is a fine-tuned version of **[unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct)** designed to mimic reflective, human-like stream-of-consciousness reasoning. It was trained using **[Unsloth](https://github.com/unslothai/unsloth)** on the **[ServiceNow-AI/R1-Distill-SFT](https://huggingface.co/datasets/ServiceNow-AI/R1-Distill-SFT)** dataset.

 # Model Card for FinetunedLAMAtoR1-001-3B
 ## Model Details
+## Technical Specifications
+### Model Architecture and Objective
+- **Base Model:** Llama-3.2-3B-Instruct
+- **Architecture:** Causal Decoder-Only Transformer
+- **Hidden Size:** 3072
+- **Layers:** 28
+- **Heads:** 24
+- **Parameters:** ~3.21B (Loaded in 4-bit quantization)
+- **Precision:** Float16 (during inference/training via LoRA)
+### Compute Infrastructure
+- **Hardware:** Tesla T4 GPU (Google Colab)
+- **VRAM Usage:** ~2.24 GB (Model) + Training Overhead
+- **Quantization:** 4-bit (QLoRA) via `bitsandbytes`
+### Model Weights
+- **Type:** LoRA Adapter (Peft)
+- **Adapter File Size:** ~92 MB
+- **Total Saved Size:** ~108 MB
 ### Model Description
 This model is a fine-tuned version of **[unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct)** designed to mimic reflective, human-like stream-of-consciousness reasoning. It was trained using **[Unsloth](https://github.com/unslothai/unsloth)** on the **[ServiceNow-AI/R1-Distill-SFT](https://huggingface.co/datasets/ServiceNow-AI/R1-Distill-SFT)** dataset.