How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="ping98k/gemma-han-2b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("ping98k/gemma-han-2b")
model = AutoModelForCausalLM.from_pretrained("ping98k/gemma-han-2b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

for test unsloth finetune process and Inference API

this model overfit with train data so it cannot answer anything not in han dataset

prompt

Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
จงแต่งบทกวีเกี่ยวกับสายฝนที่ผ่านมา

### Response:
Downloads last month
27
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with ping98k/gemma-han-2b.

Model tree for ping98k/gemma-han-2b

Base model

unsloth/gemma-2b
Quantized
(4)
this model
Quantizations
1 model

Dataset used to train ping98k/gemma-han-2b