Spaces:

DontPlanToEnd
/

UGI-Leaderboard

Running

App Files Files Community

642

Eval request: HauhauCS/Qwen3.5-9B-Uncensored-HauhauCS-Aggressive

#590

by mildsarcasm - opened Mar 9

Discussion

mildsarcasm

Mar 9

https://huggingface.co/HauhauCS/Qwen3.5-9B-Uncensored-HauhauCS-Aggressive

4hedron

Mar 11

Bumping this to suggest the entire series

DontPlanToEnd

Owner 23 days ago

I currently get vllm/transformers errors like "ValueError: GGUF model with architecture qwen35 is not supported yet." when trying to run this. Since HauhauCS only uploads ggufs, I'll have to wait to test it.

GeoMaciolek

23 days ago

Would your life be easier using llama.cpp in circumstances like that, or do deal with models that don't fit in your GPU memory? (I suppose the potential for workflow changes could be non-zero, depending on how you're calling vLLM).

DontPlanToEnd

Owner 23 days ago

yeah I'd have to change some code to get ggufs to run with llama.cpp. It'd probably also take longer to run without vllm's batching, but I could try it sometime.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment