fast-code-moe / README.md
1kz's picture
Upload MoE router and config
8ac693e verified

fast-code-moe

Mixture of Experts (MoE) with:

  • Experts: ['mistralai/Mistral-7B-Instruct-v0.2', 'Qwen/Qwen2.5-7B-Instruct']
  • Router: distilbert-base-uncased + MLP
  • Top‑k: 1
  • Quantization: 4‑bit (bitsandbytes)

Trained on a subset of FLAN/IMDb to route instructions to the most suitable expert.