0xSero
/

NousCoder-14B-SFT-Tools

Text Generation

Model card Files Files and versions

NousCoder-14B-SFT-Tools / README.md

0xSero's picture

Standardize model card (template rollout)

31405cf verified 5 days ago

|

history blame contribute delete

1.75 kB

	---
	base_model:
	- NousResearch/Hermes-3-Llama-3.1-8B
	license: mit
	pipeline_tag: text-generation
	base_model_relation: finetune
	library_name: transformers
	tags:
	- nouscoder
	- sft
	---

	> [!TIP]
	> [Support this work →](https://donate.sybilsolutions.ai) · [X](https://x.com/0xsero) · [GitHub](https://github.com/0xsero) · [REAP paper](https://arxiv.org/abs/2510.13999) · [Cerebras REAP](https://huggingface.co/collections/cerebras/cerebras-reap)

	# NousCoder-14B-SFT-Tools

	SFT fine-tune of [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B).

	## At a glance

	\| \| \|
	\|---\|---\|
	\| Base model \| [NousResearch/Hermes-3-Llama-3.1-8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B) \|
	\| Format \| SFT \|
	\| Total params \| 14B \|
	\| Active / token \| — \|
	\| Experts / layer \| — \|
	\| Layers \| — \|
	\| Hidden size \| — \|
	\| Context \| — \|
	\| On-disk size \| 1 GB \|

	## Which variant should I pick?

	\| Variant \| Format \| Link \|
	\|---\|---\|---\|
	\| `NousCoder-14B-SFT` \| SFT \| [link](https://huggingface.co/0xSero/NousCoder-14B-SFT) \|
	\| `NousCoder-14B-SFT-Tools` (this) \| SFT \| [link](https://huggingface.co/0xSero/NousCoder-14B-SFT-Tools) \|
	\| `NousCoder-14B-Tools` \| Tools \| [link](https://huggingface.co/0xSero/NousCoder-14B-Tools) \|

	## License & citation
	License inherited from the base model.

	```bibtex
	@misc{lasby2025reap,
	title = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression},
	author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa},
	year = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv}
	}
	```

	## Sponsors
	Made possible by NVIDIA · TNG Technology · Lambda · Prime Intellect · Hot Aisle.