gyung's picture
Upload README.md with huggingface_hub
f1e2b4d verified
|
Raw
History Blame Contribute Delete
2.54 kB
metadata
library_name: transformers
license: apache-2.0
base_model: LiquidAI/LFM2.5-8B-A1B
base_model_relation: finetune
tags:
  - lfm
  - liquid-ai
  - moe
  - agentic
  - terminal
  - fable-5
  - distillation
  - sft
  - ablation
language:
  - en
pipeline_tag: text-generation

Fabliq-8B-Agent-FromBase πŸŒŠπŸ”¬

Ablation variant of Fabliq-8B-Agent β€” fine-tuned directly from raw LiquidAI/LFM2.5-8B-A1B, skipping the ToolBench foundation. Used to isolate the effect of the ToolBench intermediate stage.

πŸ”¬ Why this variant?

Fabliq-8B-Agent is trained as: LiquidAI/LFM2.5-8B-A1B β†’ ToolBench-Full-SFT-1Epoch β†’ Fable-5.

This model skips the middle step: LiquidAI/LFM2.5-8B-A1B β†’ Fable-5 (direct).

Comparing the two answers the question: does ToolBench foundation actually help, or does Fable-5 alone give you the same agent?

πŸ§ͺ Model details

Architecture Lfm2MoeForCausalLM (24 layers, 32 experts, 4 experts/token)
Parameters ~8B total / ~1B active (MoE)
Context 8,192 trained Β· 128K native
Precision bfloat16
Fine-tune type Full-parameter SFT (direct from base, no ToolBench)
License Apache 2.0

πŸ“š Training data

Same as Phase-1: Fable-5-traces, 4,047 rows Γ— 3 epoch.

πŸ”§ Training procedure

Hyperparameter Value
Schedule 3 epochs, constant LR
Max sequence length 8,192
Per-device batch size 2
Gradient accumulation 4
GPUs 8Γ— H200 (effective batch 64)
Learning rate 1e-6 (higher than Phase-1's 5e-7, since starting from base)
Precision bf16
Final train_loss logged in run_config.json
Train runtime ~14 min

🌳 Model tree

LiquidAI/LFM2.5-8B-A1B
  β”œβ”€ LLM-OS-Models/LFM2.5-8B-A1B-Terminal-ToolBench-Full-SFT-1Epoch
  β”‚   └─ LLM-OS-Models/Fabliq-8B-Agent          ← Phase-1 (ToolBench β†’ Fable-5)
  └─ LLM-OS-Models/Fabliq-8B-Agent-FromBase     ← this model (base β†’ Fable-5, direct)
      └─ LLM-OS-Models/Fabliq-8B-Agent-FromBase-Reasoning  ← + reasoning expansion

πŸš€ Usage

See Fabliq-8B-Agent β€” same system prompt, same tool-call format, same inference code.

πŸ“œ License

Apache 2.0.