basmala12
/

smollm_finetuning5

@@ -1,58 +1,71 @@
-smollm_finetuning5 – Fine-Tuned SmolLM2 Model
-smollm_finetuning5 is a fine-tuned variant of the SmolLM2-1.7B model.
-The aim of this work was to adapt the base model using a lightweight instruction-tuning approach to improve coherence, reasoning, and general instruction-following on short prompts.
-The model is provided as a merged .safetensors checkpoint, meaning the LoRA adapters were fused into the base weights after training for easier deployment.
-Dataset Used
-The model was trained on the argilla/synthetic-concise-reasoning-sft-filtered dataset.
-This dataset includes:
-instruction–response pairs
-short reasoning sequences
-synthetic tasks designed to promote clear step-by-step thinking
-concise explanations and simplified reasoning samples
-The dataset is filtered to remove excessively long or noisy examples, making it suitable for fine-tuning compact models that benefit from clean and simplified instructions.
-Training Method
-Fine-tuning was conducted using LoRA (Low-Rank Adaptation) to reduce hardware requirements and allow efficient experimentation.
-Key training characteristics:
-Base model: SmolAI / SmolLM2-1.7B
-Method: LoRA fine-tuning
-Adapters were merged after training
-Precision: FP32 safetensors
-Training epochs: 3
-Learning rate: 2e-4
-Chat template included in the repository (chat_template.jinja)
-The fine-tuning process focused on improving instruction clarity and response structure rather than domain specialization.
-Model Files
-The repository contains all necessary files to load the model with standard tooling:
-model.safetensors (merged model weights)
-config.json
-generation_config.json
-tokenizer.json, vocab.json, merges.txt
-special_tokens_map.json
-chat_template.jinja

+---
+library_name: transformers
+pipeline_tag: text-generation
+base_model: SmolAI/SmolLM2-1.7B
+license: apache-2.0
+language:
+  - en
+tags:
+  - smolllm2
+  - finetuned
+  - reasoning
+  - concise
+model_type: causal-lm
+---
+# smollm_finetuning5 — Fine-Tuned SmolLM2-1.7B for Concise Instruction Reasoning
+*smollm_finetuning5* is a fine-tuned version of *SmolAI/SmolLM2-1.7B*, trained on synthetic instruction–response samples and concise reasoning data. The model is optimized to produce short, structured, and clear answers while improving general instruction-following behavior.
+The goal of this fine-tuning was to enhance reasoning clarity and response consistency in a compact 1.7B parameter model.
+---
+## Features
+- Fine-tuned for concise and structured responses
+- Improved instruction-following capabilities
+- Handles short reasoning and explanation tasks
+- Lightweight and efficient (1.7B parameters)
+- Suitable for general-purpose educational and reasoning uses
+---
+## Intended Use
+### Recommended
+- General question–answer interactions
+- Explanation of simple topics
+- Short reasoning steps
+- Instruction–response tasks
+### Not Recommended
+- High-stakes or decision-critical applications
+- Domain-specific or specialized factual tasks
+- Situations requiring verified accuracy
+---
+## Training Data
+The model was fine-tuned on:
+- argilla/synthetic-concise-reasoning-sft-filtered
+- Instruction–answer pairs
+- Synthetic reasoning prompts
+- Concise explanation samples
+The dataset consists of simplified synthetic data designed to enhance clarity, reasoning, and instruction handling.
+---
+## Training Details
+- Base Model: SmolAI/SmolLM2-1.7B
+- Fine-Tuning Method: LoRA adapters (merged into final weights)
+- Epochs: 3
+- Learning Rate: 2e-4
+- Loss: Causal language modeling
+- Output Format: FP32 safetensors
+---