How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="enochlev/MiniCPM-duplex", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("enochlev/MiniCPM-duplex", trust_remote_code=True, dtype="auto")
Quick Links

MiniCPM-duplex (safetensors)

Modern safetensors conversion of xinrongzhang2022/MiniCPM-duplex.

Weights are identical — only the serialization format has changed from pytorch_model.bin to model.safetensors, enabling memory-mapped loading and compatibility with current versions of Transformers.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

tokenizer = AutoTokenizer.from_pretrained(
    "enochlev/MiniCPM-duplex", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    "enochlev/MiniCPM-duplex",
    trust_remote_code=True,
    dtype=torch.float16,
    device_map="auto",
)

prompt = "<用户>Hello, what can you do?<AI>"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
out = model.generate(**inputs, max_new_tokens=256)
print(tokenizer.decode(out[0], skip_special_tokens=True))

Original model

See xinrongzhang2022/MiniCPM-duplex for the original weights, paper, and full documentation.

Downloads last month
-
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for enochlev/MiniCPM-duplex

Finetuned
(1)
this model