Instructions to use pranshu32/gemma-character-generator with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use pranshu32/gemma-character-generator with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="pranshu32/gemma-character-generator")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("pranshu32/gemma-character-generator", dtype="auto")

PEFT
How to use pranshu32/gemma-character-generator with PEFT:
```
Task type is invalid.
```
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use pranshu32/gemma-character-generator with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "pranshu32/gemma-character-generator"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "pranshu32/gemma-character-generator",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/pranshu32/gemma-character-generator

SGLang

How to use pranshu32/gemma-character-generator with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "pranshu32/gemma-character-generator" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "pranshu32/gemma-character-generator",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "pranshu32/gemma-character-generator" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "pranshu32/gemma-character-generator",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use pranshu32/gemma-character-generator with Docker Model Runner:
```
docker model run hf.co/pranshu32/gemma-character-generator
```

Gemma 270M Character Generator

A fine-tuned version of Google's Gemma 3 270M instruction-tuned model, specialized in generating creative character descriptions for storytelling and creative writing projects.

Model Details

Model Description

This model generates unique character names and descriptions based on story genre and setting. It has been fine-tuned using LoRA (Low-Rank Adaptation) on the CharacterCodex dataset, making it capable of creating diverse characters across various genres including Fantasy, Sci-Fi, Horror, Manga, Cyberpunk, and more.

The model takes a genre and setting as input and produces a character name followed by a detailed description including physical appearance, personality traits, and unique characteristics.

Developed by: [Your Name/Organization]
Model type: Causal Language Model (Text Generation)
Language(s): English
License: Apache 2.0
Finetuned from model: google/gemma-3-270m-it

Model Sources

Base Model: google/gemma-3-270m-it
Dataset: NousResearch/CharacterCodex

Uses

Direct Use

This model is designed for:

Creative writers generating characters for stories, novels, or screenplays
Game developers creating NPCs and character concepts
Dungeon Masters generating characters for tabletop RPGs
Content creators needing character ideas for various media
Writing prompts and creative inspiration

Example Usage

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel

# Load base model
base_model = AutoModelForCausalLM.from_pretrained(
    "google/gemma-3-270m-it",
    torch_dtype=torch.bfloat16,
    device_map="auto"
)

# Load fine-tuned LoRA adapters
model = PeftModel.from_pretrained(base_model, "your-username/gemma-character-generator")
tokenizer = AutoTokenizer.from_pretrained("your-username/gemma-character-generator")

# Generate a character
messages = [{
    "role": "user",
    "content": "Create a character for a Fantasy story. Setting: A mystical forest inhabited by ancient spirits"
}]

prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

outputs = model.generate(
    **inputs,
    max_new_tokens=256,
    temperature=0.7,
    top_p=0.9,
    repetition_penalty=1.2,
    do_sample=True
)

character = tokenizer.decode(outputs[0][inputs['input_ids'].shape[-1]:], skip_special_tokens=True)
print(character)

Example Output

Input:

Genre: Fantasy
Setting: A mystical forest inhabited by ancient spirits

Output:

Elara Moonshadow

A half-elf druid with silver hair that flows like moonlight and emerald eyes that glow faintly in the darkness. She wears robes woven from living vines and moss, adorned with crystals that pulse with ancient magic. Elara can communicate with the forest spirits and carries a staff carved from the heartwood of a thousand-year-old oak. Her presence brings calm to troubled souls, though she harbors a deep sorrow from a past betrayal.

Out-of-Scope Use

This model is not suitable for:

Generating real person descriptions or impersonating real individuals
Creating harmful, offensive, or discriminatory character stereotypes
Medical, legal, or financial advice through character personas
Generating characters for misleading or malicious purposes

Bias, Risks, and Limitations

The model may reflect biases present in the training data (CharacterCodex dataset)
Generated characters may sometimes include stereotypical traits based on genre conventions
The model works best with genres well-represented in the training data (Fantasy, Sci-Fi, Horror)
May generate repetitive descriptions if temperature is set too low
Limited to character descriptions; does not generate character stats, abilities, or game mechanics

Recommendations

Review and edit generated content to ensure it aligns with your creative vision
Adjust generation parameters (temperature, top_p, repetition_penalty) for varied outputs
Use the model as a creative starting point rather than final output
Be mindful of cultural sensitivity when using generated characters
Test with different prompts if initial results don't meet expectations

Training Details

Training Data

The model was fine-tuned on a filtered subset of the CharacterCodex dataset, containing approximately 3,000 character entries from various media sources including:

Fantasy novels and games
Sci-Fi literature
Manga and anime
Horror fiction
Video games
Tabletop RPGs

Only entries from media sources with more than 10 samples were included to ensure quality and diversity.

Training Procedure

Fine-tuning Method

LoRA (Low-Rank Adaptation) was used to efficiently fine-tune the model:

Only ~1.5% of model parameters were trained
Adapter layers applied to attention and MLP modules
Preserves base model knowledge while specializing for character generation

Data Formatting

Training examples were formatted as conversational turns:

User: Create a character for a [GENRE] story. Setting: [SETTING]
Assistant: [CHARACTER_NAME]

[CHARACTER_DESCRIPTION]

Training Hyperparameters

Base Model: google/gemma-3-270m-it (270M parameters)
Training Method: LoRA (Low-Rank Adaptation)
LoRA Rank (r): 8
LoRA Alpha: 32
LoRA Dropout: 0.05
Target Modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
Learning Rate: 5e-5
Optimizer: AdamW (8-bit)
Learning Rate Scheduler: Cosine
Warmup Ratio: 0.1
Training Epochs: 3
Batch Size per Device: 4
Gradient Accumulation Steps: 4
Effective Batch Size: 16
Max Sequence Length: 512 tokens
Training Precision: bfloat16
Training Framework: TRL (Transformers Reinforcement Learning)

Training Hardware

GPU: NVIDIA T4 / A100 (Google Colab)
Training Time: ~1-2 hours (3 epochs on 3,000 samples)
GPU Memory Usage: ~10-12 GB

Evaluation

Generation Quality

The model was evaluated through:

Manual inspection of generated characters across different genres
Coherence testing - ensuring character descriptions are logically consistent
Diversity testing - verifying varied outputs with different temperature settings
Format adherence - checking output follows expected structure (name + description)

Sample Generations

Genre: Sci-Fi
Setting: A space station orbiting a dying star

Commander Aria Vex

A cybernetically enhanced human with chrome-plated neural implants visible along her temples. Her eyes have been replaced with advanced optical sensors that glow ice-blue in low light. She wears a patched Alliance military jacket over her station-issued jumpsuit, decorated with medals from the Outer Rim conflicts. Despite her harsh exterior, she carries deep guilt over the crew members lost under her command.

Genre: Horror
Setting: An abandoned asylum with whispers in the walls

Dr. Elias Blackwood

A gaunt psychiatrist with hollow cheeks and eyes that seem to have witnessed unspeakable horrors. His white coat is stained with substances best left unidentified, and he carries a leather journal filled with illegible notes written in trembling handwriting. He speaks in hushed tones and frequently glances over his shoulder, as if something is following him through the empty corridors.

Environmental Impact

Hardware Type: NVIDIA T4 GPU (Google Colab)
Hours used: ~1.5 hours
Cloud Provider: Google Cloud Platform
Compute Region: US (variable)
Carbon Emitted: Minimal (~0.05 kg CO2eq estimated for training)

Fine-tuning with LoRA significantly reduces computational requirements compared to full model training, resulting in lower environmental impact.

Technical Specifications

Model Architecture

Architecture: Gemma (Decoder-only Transformer)
Base Parameters: 270M
Trainable Parameters: ~4M (LoRA adapters, 1.5% of total)
Attention Heads: 8
Hidden Size: 2048
Layers: 18
Context Length: 8192 tokens (base model capability)
Vocabulary Size: 256,000 tokens

Compute Infrastructure

Hardware

Training: Google Colab with NVIDIA T4/A100 GPU
Inference: Can run on consumer GPUs with 6GB+ VRAM

Software

Framework: Transformers 4.46+
Training Library: TRL (Transformers Reinforcement Learning)
PEFT Library: PEFT 0.13+
Python Version: 3.10+
PyTorch: 2.0+

Citation

If you use this model in your work, please cite:

BibTeX:

@misc{gemma-character-generator-2026,
  author = {Your Name},
  title = {Gemma 270M Character Generator: Fine-tuned Model for Creative Character Generation},
  year = {2026},
  publisher = {HuggingFace},
  journal = {HuggingFace Model Hub},
  howpublished = {\url{https://huggingface.co/your-username/gemma-character-generator}}
}

Base Model Citation:

@article{gemma_2024,
  title={Gemma: Open Models Based on Gemini Research and Technology},
  author={Gemma Team},
  year={2024},
  journal={Google DeepMind}
}

Glossary

LoRA (Low-Rank Adaptation): An efficient fine-tuning method that adds trainable low-rank matrices to model layers
PEFT (Parameter-Efficient Fine-Tuning): Techniques for fine-tuning large models with minimal parameter updates
Temperature: Controls randomness in generation; higher values (0.8-1.0) produce more creative/diverse outputs
Top-p (Nucleus Sampling): Samples from the smallest set of tokens whose cumulative probability exceeds p
Repetition Penalty: Discourages the model from repeating the same tokens/phrases

More Information

Generation Tips

For more creative characters: Increase temperature to 0.8-0.9
For more focused characters: Decrease temperature to 0.5-0.6
To prevent repetition: Set repetition_penalty to 1.2-1.3
For longer descriptions: Increase max_new_tokens to 384-512
For varied outputs: Try different random seeds with torch.manual_seed()

Supported Genres

Works well with: Fantasy, Sci-Fi, Horror, Cyberpunk, Steampunk, Manga, Anime, Mystery, Thriller, Post-Apocalyptic, Urban Fantasy, Space Opera

Model Card Authors

Pranshu Jain

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for pranshu32/gemma-character-generator

Base model

google/gemma-3-270m

Finetuned

google/gemma-3-270m-it

Adapter

(58)

this model

pranshu32
/

gemma-character-generator