plz's picture

plz

qenme

·

AI & ML interests

None yet

Recent Activity

new activity about 22 hours ago

cyankiwi/Qwen3.6-27B-AWQ-BF16-INT8:Compatible with vLLM on Ampere?

new activity 13 days ago

RedHatAI/gemma-4-31B-it-FP8-Dynamic:Can you add more details to the model card?

new activity 21 days ago

google/gemma-4-31B-it-qat-w4a16-ct:26b with w416a-ct

View all activity

Organizations

None yet

New activity in cyankiwi/Qwen3.6-27B-AWQ-BF16-INT8 about 22 hours ago

Compatible with vLLM on Ampere?

#3 opened about 2 months ago by

New activity in RedHatAI/gemma-4-31B-it-FP8-Dynamic 13 days ago

Can you add more details to the model card?

#1 opened 3 months ago by

New activity in google/gemma-4-31B-it-qat-w4a16-ct 21 days ago

26b with w416a-ct

#3 opened 22 days ago by

New activity in google/gemma-4-31B-it-qat-w4a16-ct 23 days ago

Gemma team in the back making shrimp fried rice!!!!

#1 opened 23 days ago by

New activity in Minachist/Qwen3.6-27B-INT8-AutoRound 23 days ago

KL divergence benchmark

#3 opened 24 days ago by

New activity in google/gemma-4-31B-it 25 days ago

Tokenizer problems, or just quants?

#105 opened about 2 months ago by

New activity in Minachist/Qwen3.6-27B-INT8-AutoRound about 1 month ago

Me again

#2 opened about 1 month ago by

New activity in AesSedai/Qwen3.6-35B-A3B-GGUF about 1 month ago

MTP support?

#5 opened about 1 month ago by

Q6_K?

#1 opened 2 months ago by

New activity in google/gemma-4-26B-A4B-it about 1 month ago

Very bad results with model quant and KV cache quant, only BF16 works well

#34 opened 2 months ago by

New activity in cyankiwi/Qwen3.6-27B-AWQ-BF16-INT4 about 1 month ago

F16 or BF16?

#6 opened about 1 month ago by

New activity in unsloth/Qwen3.6-27B-MTP-GGUF about 1 month ago

FYI : --spec-type mtp syntax has changed to --spec-type draft-mtp

#14 opened about 2 months ago by

New activity in unsloth/Qwen3.6-27B-MTP-GGUF about 2 months ago

presence-penalty

#8 opened about 2 months ago by

New activity in Minachist/Qwen3.6-27B-INT8-AutoRound about 2 months ago

Good quant!

#1 opened about 2 months ago by

New activity in AesSedai/MiMo-V2.5-GGUF about 2 months ago

Working good on 96GB VRAM + DDR5 Setup

#2 opened about 2 months ago by

New activity in google/gemma-4-31B-it 2 months ago

GOOLE WHERE IS MTP ?

#82 opened 2 months ago by

New activity in Qwen/Qwen3.6-27B 2 months ago

10/10

#4 opened 2 months ago by

New activity in unsloth/Qwen3.6-27B-GGUF 2 months ago

thanks!

#1 opened 2 months ago by

New activity in google/gemma-4-31B-it 2 months ago

Will there be a small model for speculative decoding?

#71 opened 2 months ago by

New activity in AesSedai/Qwen3.5-35B-A3B-GGUF 4 months ago

Thanks

#2 opened 4 months ago by