gyung's picture
Upload README.md with huggingface_hub
dd84a4d verified
|
Raw
History Blame Contribute Delete
2.34 kB
metadata
library_name: transformers
license: apache-2.0
base_model: LLM-OS-Models/Fabliq-8B-Agent-FromBase
base_model_relation: finetune
tags:
  - lfm
  - liquid-ai
  - moe
  - agentic
  - terminal
  - reasoning
  - fable-5
  - withinus
  - helio
  - distillation
  - sft
  - ablation
language:
  - en
pipeline_tag: text-generation

Fabliq-8B-Agent-FromBase-Reasoning πŸŒŠπŸ”¬πŸ§ 

Reasoning-expanded sibling of Fabliq-8B-Agent-FromBase. Adds general + deep reasoning (WithinUs + Helio) on top of the FromBase agentic foundation. Mirrors Fabliq-8B-Agent-Reasoning but for the FromBase ablation lineage.

πŸ”¬ Why this variant?

Two parallel lineages, both end with reasoning expansion:

Lineage Phase-1 Phase-2
ToolBench line Fabliq-8B-Agent (ToolBench β†’ Fable-5) Fabliq-8B-Agent-Reasoning
FromBase line (this) Fabliq-8B-Agent-FromBase (base β†’ Fable-5) Fabliq-8B-Agent-FromBase-Reasoning (this)

Comparing the two Phase-2 endpoints answers: does ToolBench foundation matter for the final reasoning model, or is Fable-5 + reasoning enough?

πŸ§ͺ Model details

Architecture Lfm2MoeForCausalLM (24 layers, 32 experts, 4 experts/token)
Parameters ~8B total / ~1B active (MoE)
Context 8,192 trained Β· 128K native
Precision bfloat16
Fine-tune type Full-parameter SFT continuation
License Apache 2.0

πŸ“š Training data (Phase-2B only)

Same as Phase-2: WithinUs 135 + Helio 146 = 281 rows Γ— 4 epoch.

πŸ”§ Training procedure

Hyperparameter Value
Base LLM-OS-Models/Fabliq-8B-Agent-FromBase
Schedule 4 epochs, LR 3e-7
Max sequence length 8,192
GPUs 8Γ— H200 (effective batch 64)
Train runtime ~6 min

🌳 Model tree

LiquidAI/LFM2.5-8B-A1B
  β”œβ”€ ToolBench line:
  β”‚   └─ LFM2.5-8B-A1B-Terminal-ToolBench-Full-SFT-1Epoch
  β”‚       └─ Fabliq-8B-Agent
  β”‚           └─ Fabliq-8B-Agent-Reasoning
  └─ FromBase line (this):
      └─ Fabliq-8B-Agent-FromBase
          └─ Fabliq-8B-Agent-FromBase-Reasoning  ← this model

πŸ“œ License

Apache 2.0.