Nocturne
Collection
Balance size model with good quality. • 6 items • Updated • 1
How to use DoppelReflEx/MN-12B-Mimicore-Nocturne with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="DoppelReflEx/MN-12B-Mimicore-Nocturne")
messages = [
{"role": "user", "content": "Who are you?"},
]
pipe(messages) # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("DoppelReflEx/MN-12B-Mimicore-Nocturne")
model = AutoModelForCausalLM.from_pretrained("DoppelReflEx/MN-12B-Mimicore-Nocturne")
messages = [
{"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=True,
return_dict=True,
return_tensors="pt",
).to(model.device)
outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))How to use DoppelReflEx/MN-12B-Mimicore-Nocturne with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "DoppelReflEx/MN-12B-Mimicore-Nocturne"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "DoppelReflEx/MN-12B-Mimicore-Nocturne",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker model run hf.co/DoppelReflEx/MN-12B-Mimicore-Nocturne
How to use DoppelReflEx/MN-12B-Mimicore-Nocturne with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "DoppelReflEx/MN-12B-Mimicore-Nocturne" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "DoppelReflEx/MN-12B-Mimicore-Nocturne",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "DoppelReflEx/MN-12B-Mimicore-Nocturne" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "DoppelReflEx/MN-12B-Mimicore-Nocturne",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'How to use DoppelReflEx/MN-12B-Mimicore-Nocturne with Docker Model Runner:
docker model run hf.co/DoppelReflEx/MN-12B-Mimicore-Nocturne
A nice, simple Slerp merge of my model, DoppelReflEx/MN-12B-Mimicore-WhiteSnake & LatitudeGames/Wayfarer-12B.
Its eval score is a bit lower compare to origin model, Mimicore-WhiteSnake. But in return have a strong advantage in Roleplay with text adventure format (3rd perspective as 'You'). It's also good at normal 3rd perspective and 1st perpective.
Overall, nice to try model, if you want to try.
{
models:
- model: DoppelReflEx/MN-12B-Mimicore-WhiteSnake
- model: LatitudeGames/Wayfarer-12B
merge_method: slerp
base_model: DoppelReflEx/MN-12B-Mimicore-WhiteSnake
parameters:
t: [0.1, 0.3, 0.6, 0.3, 0.1]
dtype: bfloat16
}