Spaces:

NyxKrage
/

LLM-Model-VRAM-Calculator

Running

New cache sizes

by Anthonyg5005 - opened Jun 9, 2024

exllamav2 now has new cache sizes, Q6 and Q8. FP8 may be removed soon

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment