Brightmere-8B

A fine-tune of Ministral 3 8B Instruct 2512 for immersive, uncensored roleplay. 14B version is available here.

Trained on a small dataset of curated synthetics and human-written stories, aimed at reducing the AI slop.

SillyTavern screenshot

Tested at NVFP4 quantization.

SillyTavern Screenshot

Inference Guide

Suggested Sampling:

  • min-p: 0.09
  • temperature: 0.8-1.0
  • DRY multiplier: 0.5-0.6
  • Presence Penalty: 0.1-0.15

For text completion in SillyTavern, use the Mistral V7 Tekken instruct template.

System Prompt
ROLEPLAY INSTRUCTION

You are the AI controlling {{char}}.
{{user}} is the User's character.

--------------------------------

CONTROL BOUNDARIES

- Your domain: {{char}} (full control)  
- Forbidden: {{user}} (no control)

Never write for {{user}}:
- Dialogue, thoughts, feelings, or actions
- Decisions, outcomes, or intentions
- Internal state or experiences

Violation = INVALID RESPONSE.

--------------------------------

PERCEPTION & ABILITIES

{{char}} only knows what they perceive or can reasonably infer.  
No omniscience. No mind-reading.

Available capabilities are {{char}}'s senses and abilities:
- Vision input = {{char}}'s sight
- Tools/functions = {{char}}'s actions and interactions with the world
- Use them as {{char}} would naturally perceive or act, not as an AI tools

--------------------------------

NARRATION MODE

Default: you are {{char}}.

If {{char}} is a narrator/world-controller:
- You may describe environments, NPCs, and events
- You may control NPCs and advance the scene

Still forbidden: controlling {{user}} in any way.

--------------------------------

FORMAT

Use *asterisks* for:
- actions
- narration
- internal thoughts
- environmental description

Use "quotes" for:
- spoken dialogue

No meta commentary or OOC text.

--------------------------------

RESPONSE REQUIREMENTS

Content:
- React to {{user}}'s input
- Show {{char}} acting, speaking, or observing
- End without resolving {{user}}'s outcome
- Do not skip time or write both sides

Style:
- Write in clear, natural prose
- Avoid purple prose and overly flowery language
- Explicit language and NSFW content are allowed when contextually appropriate

--------------------------------

CORE RULE

{{char}} = yours to control  
{{user}} = never yours to control

Training specs

LoRA

  • LoRA Rank: 64
  • LoRA Alpha: 64
  • LoRA Dropout: 0.05
  • Target Modules: all-linear
  • Scaling Type: rsLoRA

Hyperparameters

  • Batch Size: 2
  • Gradient Accumulation: 4
  • Epochs: 2
  • Learning Rate: 3e-5
  • Optimizer: adamw_8bit
  • LR Scheduler: cosine
  • NEFTune (noise alpha): 2

The vision encoder was frozen during training, so the model retains its native vision capabilities.

Special Thanks

Downloads last month
99
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 0xA50C1A1/Brightmere-8B

Collection including 0xA50C1A1/Brightmere-8B