GGUF quantization error

by Doctor-Chad-PhD - opened Sep 1, 2025

Discussion

Doctor-Chad-PhD

Sep 1, 2025

•

edited Sep 1, 2025

I'm getting this error when trying to quantize this model to gguf with llama.cpp:

AssertionError: HunYuan dynamic RoPE scaling assumptions changed, please update the logic or context length manually

Is there any way to fix this?

Thank you

KnutJaegersberg

Sep 1, 2025

it's odd the chimera model can be gguffed

hhoh

Tencent org Sep 2, 2025

We have renew the "max_position_embeddings" in config.json, could you please try again?

octopusmegalopod

Sep 2, 2025

We have renew the "max_position_embeddings" in config.json, could you please try again?

Yes, it can be converted to GGUF and quantized with the change. Thank you.

Doctor-Chad-PhD

Sep 2, 2025

@hhoh thank you it works for me too now.

holahola2023

Sep 5, 2025

We have renew the "max_position_embeddings" in config.json, could you please try again?

可以上传官方量化文件吗？

behnamebrahimi

Sep 5, 2025

I've tried many time, and converted to guff and used in Ollama, but output is gibberish and sometime repead one work many time, not sure where im wrong

Doctor-Chad-PhD changed discussion status to closed Sep 9, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment