RedHatAI
/

Qwen3-32B-speculator.eagle3

Text Generation

Model card Files Files and versions

Resources

View closed (0)

Using this speculator with Red Hat AI's quantized model

#2 opened about 2 months ago by

Slower throughput with speculative decoding

#1 opened 3 months ago by