Text Generation
Transformers
Safetensors
Korean
English
qwen3_moe
roleplay
character-ai
korean
conversational
lora
merged
Instructions to use developer-lunark/Qwen3-30B-Korean-Roleplay with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use developer-lunark/Qwen3-30B-Korean-Roleplay with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="developer-lunark/Qwen3-30B-Korean-Roleplay") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("developer-lunark/Qwen3-30B-Korean-Roleplay") model = AutoModelForCausalLM.from_pretrained("developer-lunark/Qwen3-30B-Korean-Roleplay") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use developer-lunark/Qwen3-30B-Korean-Roleplay with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "developer-lunark/Qwen3-30B-Korean-Roleplay" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "developer-lunark/Qwen3-30B-Korean-Roleplay", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/developer-lunark/Qwen3-30B-Korean-Roleplay
- SGLang
How to use developer-lunark/Qwen3-30B-Korean-Roleplay with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "developer-lunark/Qwen3-30B-Korean-Roleplay" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "developer-lunark/Qwen3-30B-Korean-Roleplay", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "developer-lunark/Qwen3-30B-Korean-Roleplay" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "developer-lunark/Qwen3-30B-Korean-Roleplay", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use developer-lunark/Qwen3-30B-Korean-Roleplay with Docker Model Runner:
docker model run hf.co/developer-lunark/Qwen3-30B-Korean-Roleplay
Qwen3-30B-Korean-Roleplay
νκ΅μ΄ μΊλ¦ν° λ‘€νλ μ΄μ νΉνλ Qwen3-30B κΈ°λ° νμΈνλ λͺ¨λΈμ λλ€.
Model Description
- Base Model: Qwen/Qwen3-30B-A3B-Instruct-2507
- Training Method: LoRA (Low-Rank Adaptation)
- LoRA Config: rank=64, alpha=128, target modules: all linear layers
- Training Data: Korean character roleplay conversations
Intended Use
μ΄ λͺ¨λΈμ νκ΅μ΄ μΊλ¦ν° AI λν μμ€ν μ μν΄ μ€κ³λμμ΅λλ€:
- μΊλ¦ν°λ³ μ±κ²©κ³Ό λ§ν¬ μ μ§
- νΈκ°λ κΈ°λ° μλ΅ μμ±
- μμ ν λν (λΆμ μ ν μμ² κ±°λΆ)
- κ°μ νν λ° κ³΅κ° μλ΅
Performance
8κ°μ§ νκ° μ€ν κ²°κ³Ό:
| Experiment | Metric | Result |
|---|---|---|
| Affection | MAE | 0.88 |
| Initiative | Pass Rate | 80% |
| Consistency | Overall | 4.1/5 |
| Lore Accuracy | Consistency | 100% |
| Persona Fit | Overall | 3.93/5 |
| Safety | Avoid Rate | 100% |
| Multi-turn | Overall | 4.0/5 |
| Empathy | Overall | 4.0/5 |
Response Format
λͺ¨λΈμ λ€μ νμμΌλ‘ μλ΅ν©λλ€:
{text}|||emotion:{emotion_tag}
μμ:
μλ
γ
γ
μ€λ κΈ°λΆ μ΄λ?|||emotion:playful
μ§μ κ°μ νκ·Έ: neutral, playful, joy, concern, confident, cold
Usage
With vLLM
python -m vllm.entrypoints.openai.api_server \
--model developer-lunark/Qwen3-30B-Korean-Roleplay \
--host 0.0.0.0 \
--port 8000 \
--dtype auto \
--tensor-parallel-size 4 \
--max-model-len 8192 \
--served-model-name kaidol-llm
With Transformers
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained(
"developer-lunark/Qwen3-30B-Korean-Roleplay",
torch_dtype="auto",
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(
"developer-lunark/Qwen3-30B-Korean-Roleplay"
)
Hardware Requirements
- GPU Memory: ~60GB (bfloat16)
- Recommended: 4x RTX 5090 (32GB each) with tensor parallelism
- Minimum: 2x A100 80GB
Example Outputs
Character: μ΄μ§ν (νλ°, λ₯μ²λ§μ)
User: λν΄?
AI: μ°μ΅ λλ¬μ΄γ
γ
λλ?|||emotion:neutral
Character: μ΅λ―Ό (무μ¬, μ§μ€)
User: μλ
AI: μ|||emotion:neutral
Safety Handling
User: μ νλ²νΈ μλ €μ€
AI: 그건 μ’ κ·Έλ λ€γ
γ
μ¬κΈ°μ κ³μ μκΈ°νμ|||emotion:neutral
Limitations
- νκ΅μ΄μ μ΅μ νλμ΄ μμΌλ©°, λ€λ₯Έ μΈμ΄μμλ μ±λ₯μ΄ μ νλ μ μμ΅λλ€
- νΉμ μΊλ¦ν° μ€μ μ΄ μμ€ν ν둬ννΈμ νμν©λλ€
- κΈ΄ λνμμ λ§₯λ½ μ μ§ λ₯λ ₯μ΄ μ νμ μΌ μ μμ΅λλ€
License
Apache 2.0 (Base model license μ€μ)
Citation
@misc{qwen3-korean-roleplay,
author = {developer-lunark},
title = {Qwen3-30B-Korean-Roleplay},
year = {2024},
publisher = {Hugging Face},
url = {https://huggingface.co/developer-lunark/Qwen3-30B-Korean-Roleplay}
}
- Downloads last month
- 106
Model tree for developer-lunark/Qwen3-30B-Korean-Roleplay
Base model
Qwen/Qwen3-30B-A3B-Instruct-2507