--- library_name: transformers license: apache-2.0 base_model: LiquidAI/LFM2.5-8B-A1B base_model_relation: finetune tags: - lfm - liquid-ai - moe - agentic - terminal - fable-5 - distillation - sft - ablation language: - en pipeline_tag: text-generation --- # Fabliq-8B-Agent-FromBase ๐ŸŒŠ๐Ÿ”ฌ > **Ablation variant** of [Fabliq-8B-Agent](https://huggingface.co/LLM-OS-Models/Fabliq-8B-Agent) โ€” fine-tuned directly from raw [`LiquidAI/LFM2.5-8B-A1B`](https://huggingface.co/LiquidAI/LFM2.5-8B-A1B), skipping the ToolBench foundation. Used to isolate the effect of the ToolBench intermediate stage. ## ๐Ÿ”ฌ Why this variant? [Fabliq-8B-Agent](https://huggingface.co/LLM-OS-Models/Fabliq-8B-Agent) is trained as: `LiquidAI/LFM2.5-8B-A1B` โ†’ `ToolBench-Full-SFT-1Epoch` โ†’ `Fable-5`. **This model** skips the middle step: `LiquidAI/LFM2.5-8B-A1B` โ†’ `Fable-5` (direct). Comparing the two answers the question: **does ToolBench foundation actually help, or does Fable-5 alone give you the same agent?** ## ๐Ÿงช Model details | | | | --- | --- | | **Architecture** | Lfm2MoeForCausalLM (24 layers, 32 experts, 4 experts/token) | | **Parameters** | ~8B total / ~1B active (MoE) | | **Context** | 8,192 trained ยท 128K native | | **Precision** | bfloat16 | | **Fine-tune type** | Full-parameter SFT (direct from base, no ToolBench) | | **License** | Apache 2.0 | ## ๐Ÿ“š Training data Same as Phase-1: [Fable-5-traces](https://huggingface.co/datasets/Glint-Research/Fable-5-traces), 4,047 rows ร— 3 epoch. ## ๐Ÿ”ง Training procedure | Hyperparameter | Value | | --- | --- | | Schedule | 3 epochs, constant LR | | Max sequence length | 8,192 | | Per-device batch size | 2 | | Gradient accumulation | 4 | | GPUs | 8ร— H200 (effective batch 64) | | Learning rate | **1e-6** (higher than Phase-1's 5e-7, since starting from base) | | Precision | bf16 | | Final train_loss | logged in run_config.json | | Train runtime | ~14 min | ## ๐ŸŒณ Model tree ``` LiquidAI/LFM2.5-8B-A1B โ”œโ”€ LLM-OS-Models/LFM2.5-8B-A1B-Terminal-ToolBench-Full-SFT-1Epoch โ”‚ โ””โ”€ LLM-OS-Models/Fabliq-8B-Agent โ† Phase-1 (ToolBench โ†’ Fable-5) โ””โ”€ LLM-OS-Models/Fabliq-8B-Agent-FromBase โ† this model (base โ†’ Fable-5, direct) โ””โ”€ LLM-OS-Models/Fabliq-8B-Agent-FromBase-Reasoning โ† + reasoning expansion ``` ## ๐Ÿš€ Usage See [Fabliq-8B-Agent](https://huggingface.co/LLM-OS-Models/Fabliq-8B-Agent) โ€” same system prompt, same tool-call format, same inference code. ## ๐Ÿ“œ License Apache 2.0.