Does it not support FP8 Kvcache?

#2
by stoneopsx - opened

Does it not support FP8 Kvcache

u needs to modify the "vllm/_custom_ops.py" file to enable support for kv_cache_dtype.

Sign up or log in to comment