Tigdora
/

lfm-2.5-coding-tool_gguf

Model card Files Files and versions

Tigdora commited on Feb 12

Commit

23004d2

·

verified ·

1 Parent(s): 451b700

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -9,8 +9,32 @@ base_model:
 - LiquidAI/LFM2.5-1.2B-Instruct
 ---
-## Training Summary
 {
   "_runtime": 348,
   "_step": 60,

 - LiquidAI/LFM2.5-1.2B-Instruct
 ---
+## 📉 Training Results & Metrics
+This model was fine-tuned on **Liquid LFM-2.5-1.2B-Instruct** using **Unsloth** and T4 GPUs. The following metrics were recorded during the final training run.
+| Metric | Value | Description |
+| :--- | :--- | :--- |
+| **Final Loss** | `0.7431` | The model's error rate at the final step. |
+| **Average Train Loss** | `0.8274` | The average error rate across the entire session. |
+| **Epochs** | `0.96` | completed ~1 full pass over the dataset. |
+| **Global Steps** | `60` | Total number of optimizer updates. |
+| **Runtime** | `594s` (~10 min) | Total wall-clock time for training. |
+| **Samples/Second** | `0.808` | Throughput speed on T4 GPU. |
+| **Gradient Norm** | `0.345` | Indicates stable training (no exploding gradients). |
+| **Learning Rate** | `3.64e-6` | Final learning rate after decay. |
+| **Total FLOS** | `2.07e15` | Total floating-point operations computed. |
+### 🛠️ Hardware & Framework
+* **Hardware:** NVIDIA Tesla T4 (Google Colab Free Tier)
+* **Framework:** Unsloth (PyTorch)
+* **Quantization:** 4-bit (QLoRA)
+* **Optimizer:** AdamW 8-bit
+<details>
+<summary><strong>View Raw Training Log (JSON)</strong></summary>
+```json
 {
   "_runtime": 348,
   "_step": 60,