🪽 SIQ-1 Tiny

#2581

by redaihf - opened 16 days ago

Discussion

redaihf

16 days ago

https://huggingface.co/AlexWortega/SIQ-1-35B

RichardErkhov

16 days ago

It's queued, but not sure if it will quant as failed recently

You can check for progress at http://hf.tst.eu/status.html or regularly check the model
summary page at https://hf.tst.eu/model#SIQ-1-35B-GGUF for quants to appear.

redaihf

14 days ago

I assume quanting has failed again. Do you have any idea why @AlexWortega ?

RichardErkhov

13 days ago

llama_model_load: error loading model: missing tensor 'blk.40.attn_norm.weight'

redaihf

10 days ago

This may have been be fixed.

MuXodious

7 days ago

•

edited 7 days ago

There's a project for grafting in MTP layers from the base model for GGUF encapsulated models. They seem to suggest a similar approach in their fix. It requires additional work with each GGUF quant. Instead, they can try re-exporting the model while ensuring that MTP weights are loaded. A complete safetensors model should help facilitate mradermacher team's strival for producing GGUF quants.

That's being said, I have conjured a small script for grafting MTP layers directly into the safetensors instead, which I used for the MTP restored models including the experimental https://huggingface.co/MuXodious/MiniCPM-V-4.6-Thinking-heretic-MTP. ⛱️

AlexWortega

7 days ago

maybe because of mtp
maybe because of I disabled Vl mldule

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment