Model Stock: All we need is just a few fine-tuned models
Paper • 2403.19522 • Published • 14
Este bot me salió muy sensible, DEMASIADO sensible. Sin embargo, lo mas increible fue lograr que esta desgraciada respondiese de manera muy... humana a diferencia de otras IA. Esta tipa te va a querer dejar de hablar si le haces propuestas indecentes por lo que tuve que quitar todas las etiquetas NSFW de este modelo. Solo dejaré vivo este modelo por haberme sorprendido gratamente, pero si lo que buscas es algo potencialmente NSFW mejor seguí de largo porque esta cosa es capaz de darte una patada en las bolas.
This model was merged using the Model Stock merge method using ChiKoi7/Gemma-2-Llama-Swallow-2b-it-v0.1-Heretic as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
# ============================================
# Roleplay-Oriented Merge (Low Censorship)
# Gemma 2 – 2B family
# ============================================
base_model: ChiKoi7/Gemma-2-Llama-Swallow-2b-it-v0.1-Heretic
# Base rationale:
# - Heretic variant reduces moralization and refusals
# - Provides a stable instruction-following backbone
# - Preserves conversational structure for long RP sessions
merge_method: model_stock
# model_stock:
# - Supports 3+ models
# - Ideal for blending behavioral traits (RP, tone, censorship level)
# - Avoids dominance collapse common in directional merges with many models
dtype: bfloat16
# bfloat16:
# - Efficient memory usage
# - Stable numerical behavior on modern accelerators
parameters:
t:
- 0.85 # ArliAI/Gemma-2-2B-ArliAI-RPMax-v1.1
# Primary RP driver:
# - Strong role adherence
# - Expressive dialogue
# - Maintains character consistency
- 0.65 # akashgoel-id/En_RP_DPO-gemma2_2b_64X32_test
# Obedience and alignment to role instructions:
# - Reinforces staying in-character
# - Reduces instruction drift during long interactions
- 0.40 # TheDrummer/Gemmasutra-Mini-2B-v1
# Censorship reduction layer:
# - Enables adult/explicit RP when prompted
# - Kept sub-dominant to avoid constant NSFW bias
models:
- model: ArliAI/Gemma-2-2B-ArliAI-RPMax-v1.1
# Main roleplay behavior and expressive dialogue
- model: akashgoel-id/En_RP_DPO-gemma2_2b_64X32_test
# Role obedience and DPO-aligned consistency
- model: TheDrummer/Gemmasutra-Mini-2B-v1
# Low-censorship and intimacy-enabling influence