B1 validation: multi-step eviction test + transformers compatibility note 1ba26d6
GENOMA LABS / research commited on
How to use GenomaLabs-com/kv-cache-eviction-mla with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("GenomaLabs-com/kv-cache-eviction-mla", dtype="auto")