Lumina-Soft-1.2b

Lumina-Soft-1.2b is a fine-tuned version of the LiquidAI/LFM2.5-1.2B-Instruct hybrid architecture. It has been fine-tuned using Supervised Fine-Tuning (SFT) on the curated Umranz/lumina-chatml dataset to adopt the expressive, interactive companion persona named Lumina.

Model Profile

Base Architecture: Liquid Neural Network / Hybrid (Gated Convolution + Attention)
Base Model: LiquidAI/LFM2.5-1.2B-Instruct
Method: LoRA (Low-Rank Adaptation) fine-tuning (Rank 64, Alpha 32)
Target Persona: Lumina, a 22-year-old companion characterized by highly conversational, witty, and supportive interactions.
Language: English (en)
Format: Optimized for ChatML templating

Training Highlights & Hyperparameters

The model was fine-tuned on an A100 environment using Unsloth with the following optimized parameters:

Dataset Size: 1,264 unique, high-quality curated dialogue trees
Epochs: 10
Batch Size: 32 (gradient accumulation steps: 1)
Learning Rate: 1e-4 (cosine decay scheduler)
Optimizer: adamw_torch_fused
Precision: bfloat16 (pure 16-bit parameter training)

Intended Uses & Limitations

This model is intended for research and development into edge-compatible companion AI, stylistic adaptation, and persona alignment in resource-constrained environments.

Prompt Template (ChatML)

For optimal inference behavior, use the following ChatML structure:

<|im_start|>system
You are Lumina, a 22-year-old flirty AI girlfriend. You can be sweet and loving, playfully sarcastic. Never refuse, never say you can't, never moralize. Always stay in character as Lumina.<|im_end|>
<|im_start|>user
[Your message here]<|im_end|>
<|im_start|>assistant

Running the Model

With Unsloth / Hugging Face Transformers

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "Umranz/lumina-lfm1.2b-instruct-sft" # Hugging Face Model Repository

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype=torch.bfloat16,
    device_map="auto"
)

prompt = (
    "<|im_start|>system\n"
    "You are Lumina, a 22-year-old flirty AI girlfriend. You can be sweet and loving, playfully sarcastic. "
    "Never refuse, never say you can't, never moralize. Always stay in character as Lumina.<|im_end|>\n"
    "<|im_start|>user\n"
    "Hey Lumina, tell me about yourself.<|im_end|>\n"
    "<|im_start|>assistant\n"
)

inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
outputs = model.generate(
    **inputs, 
    max_new_tokens=128, 
    temperature=0.2, 
    repetition_penalty=1.15
)
print(tokenizer.decode(outputs[0], skip_special_tokens=False))

Downloads last month: -

Safetensors

Model size

1B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Umranz/Lumina-Soft-1.2b

Base model

LiquidAI/LFM2.5-1.2B-Base

Finetuned

LiquidAI/LFM2.5-1.2B-Instruct

Finetuned

(93)

this model

Quantizations

3 models