gyung's picture
Upload README.md with huggingface_hub
dd84a4d verified
|
Raw
History Blame Contribute Delete
2.34 kB
---
library_name: transformers
license: apache-2.0
base_model: LLM-OS-Models/Fabliq-8B-Agent-FromBase
base_model_relation: finetune
tags:
- lfm
- liquid-ai
- moe
- agentic
- terminal
- reasoning
- fable-5
- withinus
- helio
- distillation
- sft
- ablation
language:
- en
pipeline_tag: text-generation
---
# Fabliq-8B-Agent-FromBase-Reasoning πŸŒŠπŸ”¬πŸ§ 
> **Reasoning-expanded sibling** of [Fabliq-8B-Agent-FromBase](https://huggingface.co/LLM-OS-Models/Fabliq-8B-Agent-FromBase). Adds general + deep reasoning (WithinUs + Helio) on top of the FromBase agentic foundation. Mirrors [Fabliq-8B-Agent-Reasoning](https://huggingface.co/LLM-OS-Models/Fabliq-8B-Agent-Reasoning) but for the FromBase ablation lineage.
## πŸ”¬ Why this variant?
Two parallel lineages, both end with reasoning expansion:
| Lineage | Phase-1 | Phase-2 |
| --- | --- | --- |
| **ToolBench line** | `Fabliq-8B-Agent` (ToolBench β†’ Fable-5) | `Fabliq-8B-Agent-Reasoning` |
| **FromBase line (this)** | `Fabliq-8B-Agent-FromBase` (base β†’ Fable-5) | `Fabliq-8B-Agent-FromBase-Reasoning` (this) |
Comparing the two Phase-2 endpoints answers: **does ToolBench foundation matter for the final reasoning model, or is Fable-5 + reasoning enough?**
## πŸ§ͺ Model details
| | |
| --- | --- |
| **Architecture** | Lfm2MoeForCausalLM (24 layers, 32 experts, 4 experts/token) |
| **Parameters** | ~8B total / ~1B active (MoE) |
| **Context** | 8,192 trained Β· 128K native |
| **Precision** | bfloat16 |
| **Fine-tune type** | Full-parameter SFT continuation |
| **License** | Apache 2.0 |
## πŸ“š Training data (Phase-2B only)
Same as Phase-2: WithinUs 135 + Helio 146 = 281 rows Γ— 4 epoch.
## πŸ”§ Training procedure
| Hyperparameter | Value |
| --- | --- |
| Base | `LLM-OS-Models/Fabliq-8B-Agent-FromBase` |
| Schedule | 4 epochs, LR 3e-7 |
| Max sequence length | 8,192 |
| GPUs | 8Γ— H200 (effective batch 64) |
| Train runtime | ~6 min |
## 🌳 Model tree
```
LiquidAI/LFM2.5-8B-A1B
β”œβ”€ ToolBench line:
β”‚ └─ LFM2.5-8B-A1B-Terminal-ToolBench-Full-SFT-1Epoch
β”‚ └─ Fabliq-8B-Agent
β”‚ └─ Fabliq-8B-Agent-Reasoning
└─ FromBase line (this):
└─ Fabliq-8B-Agent-FromBase
└─ Fabliq-8B-Agent-FromBase-Reasoning ← this model
```
## πŸ“œ License
Apache 2.0.