BoltMonkey
/

boltmonkey_shortreasoning-8b

Text Generation

Model card Files Files and versions

boltmonkey_shortreasoning-8b / README.md

BoltMonkey's picture

Add model card

1bafddb verified 10 months ago

|

history blame contribute delete

664 Bytes

	---
	license: apache-2.0
	base_model: BoltMonkey/SuperNeuralDreadDevil-8b
	datasets:
	- Nitral-Archive/Cosmopedia-Instruct-60k-Distilled-R1-70B-ShareGPT-Reason-Tags
	pipeline_tag: text-generation
	tags:
	- qlora
	- 4bit
	- merged
	---

	# boltmonkey_shortreasoning‑8b

	Merged QLoRA adapter plus base weights.
	Fine‑tuned for short‑form chain‑of‑thought reasoning.

	* Base model: SuperNeuralDreadDevil-8b
	* Dataset: Cosmopedia‑Instruct 60k (ShareGPT style)
	* Context length: 1096 tokens
	* Training: 4 epochs, LoRA r = 32, α = 16, dropout 0.05, fp16, 4‑bit quant

	See `train_args.json` for the full Axolotl config.