Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 65 items • Updated 7 days ago • 157
DFlash Collection Block Diffusion for Flash Speculative Decoding • 21 items • Updated 6 days ago • 113