Text Generation
Transformers
Safetensors
mistral
conversational
text-generation-inference
How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="zerofata/MS3.2-PaintedFantasy-24B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("zerofata/MS3.2-PaintedFantasy-24B")
model = AutoModelForCausalLM.from_pretrained("zerofata/MS3.2-PaintedFantasy-24B")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

PAINTED FANTASY

Mistral Small 3.2 24B

image/png

Overview

Experimental release.

This is an uncensored creative model intended to excel at character driven RP / ERP.

This model is designed to provide longer, narrative heavy responses where characters are portrayed accurately and proactively.

SillyTavern Settings

Recommended Roleplay Format

> Actions: In plaintext
> Dialogue: "In quotes"
> Thoughts: *In asterisks*

Recommended Samplers

> Temp: 0.8
> MinP: 0.04 - 0.05
> TopP: 0.95 - 1.0
> Dry: 0.8, 1.75, 4

Instruct

Mistral v7 Tekken

Quantizations

Training Process

Training process: Pretrain > SFT > DPO > DPO 2

Did a small pretrain on some light novels and Frieren wiki data as a test. Hasn't seemed to hurt the model and model has shown some small improvements in the lore of series that were included.

The model then went through the standard SFT using a dataset of approx 3.6 million tokens, 700 RP conversations, 1000 creative writing / instruct samples and about 100 summaries. The bulk of this data has been made public.

Finally DPO was used to make the model a little more consistent. The first stage of DPO focused on instruction following and the second tried to burn out some Mistral-isms.

Downloads last month
13
Safetensors
Model size
24B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with zerofata/MS3.2-PaintedFantasy-24B.

Model tree for zerofata/MS3.2-PaintedFantasy-24B

Finetuned
(62)
this model
Adapters
3 models
Finetunes
1 model
Merges
4 models
Quantizations
6 models

Datasets used to train zerofata/MS3.2-PaintedFantasy-24B

Collection including zerofata/MS3.2-PaintedFantasy-24B