How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="K-Compression/HyperCLOVAX-SEED-Think-14B-GPTQ", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("K-Compression/HyperCLOVAX-SEED-Think-14B-GPTQ", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("K-Compression/HyperCLOVAX-SEED-Think-14B-GPTQ", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

HyperCLOVAX-SEED-Think-14B-GPTQ

Instruction

This repo contains GPTQ model files for HyperCLOVAX-SEED-Think-14B.

HyperCLOVAX-SEED-Think-14B-GPTQ was quantized using gptqmodel v4.0.0, following the guide.

Model Configuration

Quickstart

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "K-Compression/HyperCLOVAX-SEED-Think-14B-GPTQ"
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="bfloat16",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

Performance(Non-Think)

Model MMLU (0-shot) HAERAE (0-shot)
HyperCLOVA X SEED 14B Think 0.7144 0.8130
HyperCLOVA X SEED 14B Think-GPTQ 0.7018 0.8139

License

The model is licensed under HyperCLOVA X SEED Model License Agreement

Downloads last month
137
Safetensors
Model size
15B params
Tensor type
I32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for K-Compression/HyperCLOVAX-SEED-Think-14B-GPTQ

Quantized
(3)
this model