--- base_model: - NousResearch/Hermes-3-Llama-3.1-8B license: mit pipeline_tag: text-generation base_model_relation: finetune library_name: transformers tags: - nouscoder - sft --- > [!TIP] > **[Support this work →](https://donate.sybilsolutions.ai)** · [X](https://x.com/0xsero) · [GitHub](https://github.com/0xsero) · [REAP paper](https://arxiv.org/abs/2510.13999) · [Cerebras REAP](https://huggingface.co/collections/cerebras/cerebras-reap) # NousCoder-14B-SFT-Tools SFT fine-tune of [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B). ## At a glance | | | |---|---| | Base model | [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B) | | Format | SFT | | Total params | **14B** | | Active / token | — | | Experts / layer | — | | Layers | — | | Hidden size | — | | Context | — | | On-disk size | 1 GB | ## Which variant should I pick? | Variant | Format | Link | |---|---|---| | `NousCoder-14B-SFT` | SFT | [link](https://huggingface.co/0xSero/NousCoder-14B-SFT) | | `NousCoder-14B-SFT-Tools` **(this)** | SFT | [link](https://huggingface.co/0xSero/NousCoder-14B-SFT-Tools) | | `NousCoder-14B-Tools` | Tools | [link](https://huggingface.co/0xSero/NousCoder-14B-Tools) | ## License & citation License inherited from the base model. ```bibtex @misc{lasby2025reap, title = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression}, author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa}, year = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv} } ``` ## Sponsors Made possible by **NVIDIA · TNG Technology · Lambda · Prime Intellect · Hot Aisle**.