ConicCat
/

Apriel-R1PV.2-Thinking

Model card Files Files and versions

Apriel-R1PV.2-Thinking / README.md

ConicCat's picture

Update README.md

b8207fd verified 9 months ago

|

history blame contribute delete

1.58 kB

	---
	base_model:
	- ServiceNow-AI/Apriel-Nemotron-15b-Thinker
	datasets:
	- Undi95/R1-RP-ShareGPT3
	- PJMixers-Dev/Gryphe-Aesir-RPG-Charcards-Opus-Mixed-split-v3-0324
	---

	### Apriel R1P V.2

	Day 2 RP finetune of Apriel 15B, with several iterative improvements from the first version. In particular, coherence at good temperatures (~.7) should be much higher.

	I also fully converted the model to use the Phi 3 format; this comes at the slight tradeoff of the `<\|end\|>` tag not always tokenizing exactly the same way in a few niche scenarios.

	Further attempts were made to fix formatting issues with asterisks on the base model.

	NOTE: THIS IS THE THINKING VERSION

	Upon further testing, I discovered that while merging back onto the instruct improved thinking mode, this came at the cost of degraded non-thinking outputs.

	Use the [non-thinking version](https://huggingface.co/ConicCat/Apriel-R1PV.2-NoThink) instead if you want a standard model.

	## Thinking Mode

	- To enable thinking place `/think` in the system prompt and prefill `<\|think\|>\n` for thinking mode.

	- Phi esque thinking tags, `<\|think\|>` and `<\|/think\|>` have been added to the model.

	- Remember to reconfigure Sillytavern to parse the new think tags.

	## Settings

	The chat template has been converted to a Phi 3 template as the model seemed to respond best to this format.

	This model does prefer having character cards placed in user messages, not the system prompt.

	## Special Thanks:

	Undi95 for portions of their dataset and inspiration.

	PJMixers-Dev for their dataset curation and creation efforts.