v3 release — what's new, what's next, feedback welcome

#1
by clemsail - opened

Hi everyone — micro-kiki-v3 just dropped (17/04). Opening this thread to share context and collect feedback.

What this is

A cognitive LLM stack specialized in embedded engineering, not a flat fine-tune:

  • Router → top-4 selection among 35 domain-specific LoRA stacks
  • Base — Qwen3.5-35B-A3B (MoE 256 experts, 3B active/token), rank-16 LoRAs on q/k/v/o, top-2 routing per stack
  • Null-space projection between stacks to reduce catastrophic forgetting
  • Negotiator (CAMP + Catfish) when stacks disagree
  • Anti-bias layer (KnowBias + RBD) before output
  • Aeon memory (Atlas graph + Trace log) for cross-session persistence

Context 262K tokens, GGUF, Apache 2.0, French + English interleaved.

Companion release — training toolkit is also public

We released the training toolkit one day earlier: L-electron-Rare/KIKI-Mac_tunner — an MLX fine-tuning toolkit (Mac Studio) for distilling Claude Opus reasoning into Mistral Large 123B. The full pipeline is open, not just the artifact.

Dataset

clemsail/micro-kiki-v3-dataset — 489K instruction-following examples across 35 domains:

  • 50,116 real Claude CLI sessions captured on our 5-node P2P mesh during actual embedded consulting work (GrosMac M5, Tower, CILS, KXKM-AI RTX 4090, VM bootstrap)
  • 364K from 19 filtered open-source HF datasets (CodeFeedback, French-Alpaca, Electronics StackExchange, stm32-hal-dataset, etc.)
  • Opus teacher distillation for chat-fr and reasoning
  • 32 original curated seed sets per domain

What I'd love from you

  • Benchmarks against base Qwen3.5 / GPT-4 / Claude on embedded-specific tasks — I only have internal eval. A reproducible public benchmark is on the v4 roadmap but community runs would help.
  • Edge cases where the router picks the wrong stack — especially useful for improving the classifier in v4.
  • Memory/inference regressions on your hardware (Q4_K_M works cleanly on Apple Silicon 32GB+ and RTX 4090; other configs untested).
  • Domains we missed — the 35 categories are pragmatic not exhaustive, happy to add in v4.

Ecosystem overview: github.com/L-electron-Rare — 13 public repos spanning the FineFab platform (hardware, firmware, CAD, ML).

Issues, forks, negative findings all welcome. Apache 2.0 means fork it, break it, improve it.

— Clément / L'Électron Rare

Sign up or log in to comment