Round 3b prep: control-validation driver (plain HF transformers, no AirLLM) 754890f
GENOMA LABS / research Claude Opus 4.7 (1M context) commited on
How to use GenomaLabs-com/kv-cache-eviction-mla with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("GenomaLabs-com/kv-cache-eviction-mla", dtype="auto")