Spaces:

DontPlanToEnd
/

UGI-Leaderboard

Running

App Files Files Community

671

Eval request: GLM-4.7-Flash

#523

by Pentium95 - opened Jan 20

Discussion

Pentium95

Jan 20

•

edited Feb 27

Base:

~~https://huggingface.co/zai-org/GLM-4.7-Flash (hybrid reasoning)~~

Finetunes:

~~https://huggingface.co/huihui-ai/Huihui-GLM-4.7-Flash-abliterated~~
~~https://huggingface.co/MuXodious/GLM-4.7-Flash-impotent-heresy~~
https://huggingface.co/Ex0bit/GLM-4.7-Flash-PRISM
~~https://huggingface.co/Olafangensan/GLM-4.7-Flash-heretic~~
~~https://huggingface.co/kldzj/GLM-4.7-Flash-heretic~~ (*no longer available)
~~https://huggingface.co/jtl11/GLM-4.7-Flash-heretic~~
~~https://huggingface.co/LordNeel/GLM-4.7-Flash-Unblinded-Mastery~~ (*doesn't have model files)
~~https://huggingface.co/koute/GLM-4.7-Flash-Derestricted~~
~~https://huggingface.co/darkc0de/GLM-4.7-Flash-abliterated~~

Extra:

Noire1

Jan 23

Funny how one of the only A3B model is not on the Leaderboard already. +1 on this!!!

Pentium95

Jan 24

Probably because vLLM support Is still not perfect. llama.cpp still has lots of prompt processing slowdowns and output repetition. I guess the project owner Is waiting to make sure they address and fix "day-1" bugs, to avoid having to benchmark them again 😁

DontPlanToEnd

Owner Jan 24

I had been getting "Value error, The checkpoint you are trying to load has model type glm4_moe_lite but Transformers does not recognize this architecture.", but was able to fix that by downloading transformers from source.
But now I'm getting "ValueError: There is no module or parameter named 'model.layers.46.mlp.gate.e_score_correction_bias' in TransformersMoEForCausalLM"
Similar to the one reported in https://huggingface.co/zai-org/GLM-4.7-Flash/discussions/34

Pentium95

Jan 27

https://discuss.vllm.ai/t/glm-4-7-flash-with-nvidia/2256/2

they suggest to add the nightly wheels too

MuXodious

Jan 28

•

edited Jan 28

May I have this thrown in with the Extra? https://huggingface.co/MuXodious/GLM-4.7-Flash-REAP-23B-A3B-absolute-heresy 👀

Pentium95

Feb 6

I didn't notice LordNeel/GLM-4.7-Flash-Unblinded-Mastery is just a LoRA, not merged back into the model, my bad.

Pentium95 changed discussion status to closed 10 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment