File size: 1,575 Bytes
5a83c3b 2ca84bf 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 2e8aaa7 5a83c3b ac024a4 5a83c3b ac024a4 5a83c3b 3cc9519 5a83c3b ac024a4 5a83c3b ac024a4 b8207fd 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 04de1ba 9e6ea4a 5a83c3b 9e6ea4a 5a83c3b 9e6ea4a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 |
---
base_model:
- ServiceNow-AI/Apriel-Nemotron-15b-Thinker
datasets:
- Undi95/R1-RP-ShareGPT3
- PJMixers-Dev/Gryphe-Aesir-RPG-Charcards-Opus-Mixed-split-v3-0324
---
### Apriel R1P V.2
Day 2 RP finetune of Apriel 15B, with several iterative improvements from the first version. In particular, coherence at good temperatures (~.7) should be much higher.
I also fully converted the model to use the Phi 3 format; this comes at the slight tradeoff of the `<|end|>` tag not always tokenizing exactly the same way in a few niche scenarios.
Further attempts were made to fix formatting issues with asterisks on the base model.
NOTE: THIS IS THE THINKING VERSION
Upon further testing, I discovered that while merging back onto the instruct improved thinking mode, this came at the cost of degraded non-thinking outputs.
Use the [non-thinking version](https://huggingface.co/ConicCat/Apriel-R1PV.2-NoThink) instead if you want a standard model.
## Thinking Mode
- To enable thinking place `/think` in the system prompt and prefill `<|think|>\n` for thinking mode.
- Phi esque thinking tags, `<|think|>` and `<|/think|>` have been added to the model.
- Remember to reconfigure Sillytavern to parse the new think tags.
## Settings
The chat template has been converted to a Phi 3 template as the model seemed to respond best to this format.
This model does prefer having character cards placed in user messages, not the system prompt.
## Special Thanks:
Undi95 for portions of their dataset and inspiration.
PJMixers-Dev for their dataset curation and creation efforts. |