klyrone
/

Chimera

+---
+license: apache-2.0
+tags:
+  - chimera
+  - moe
+  - mixture-of-experts
+  - gguf
+  - klyrone
+language:
+  - en
+pipeline_tag: text-generation
+---
+# Chimera 8x7B
+**Chimera** is a Mixture-of-Experts language model developed by [Klyrone Tech](https://huggingface.co/kk497055), built using our proprietary **Amalgamation of Experts (AoE)** technique. Chimera features 8 specialized expert networks with top-2 routing, delivering strong instruction-following and reasoning capabilities with an efficient 12.9B active parameter footprint.
+## Model Details
+| | |
+|---|---|
+| **Architecture** | Mixture of Experts (MoE) - 8 experts, top-2 routing |
+| **Total Parameters** | 46.7B |
+| **Active Parameters** | 12.9B per token |
+| **Context Length** | 32,768 tokens |
+| **Quantization** | Q5_K_M (GGUF) |
+| **Developed by** | Klyrone Tech |
+## Key Features
+- **Efficient MoE Architecture** - Only 12.9B parameters active per forward pass despite 46.7B total, enabling fast inference
+- **Specialized Expert Networks** - 8 expert FFN modules with learned routing for task-adaptive computation
+- **Instruction-Tuned Experts** - Expert networks optimized for instruction following, code generation, and reasoning
+- **Long Context** - Supports up to 32K token context windows with RoPE positional encoding
+## Amalgamation of Experts (AoE)
+Chimera is built using our **AoE** technique, a novel approach to constructing high-quality MoE models by strategically assembling expert networks. AoE enables the creation of models that combine specialized capabilities from multiple training paradigms into a unified, coherent architecture.
+## Usage
+### With llama.cpp
+```bash
+./llama-cli -m Chimera-8x7B-Q5_K_M.gguf -p "Your prompt here" -n 500 -ngl 99
+```
+## Intended Use
+Chimera is designed for general-purpose text generation including conversational AI, code generation, reasoning, and instruction following.
+## License
+Apache 2.0