ospatch
/

QwQ-32B-INT8-W8A8

Text Generation

text-generation-inference

8-bit precision

compressed-tensors

Model card Files Files and versions

Resources

View closed (0)

Woks 2x slower than GGUF q8

#2 opened 11 months ago by

Context length

#1 opened 12 months ago by

matthew-at-qamcom