zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression Paper • 2506.01084 • Published Jun 1, 2025 • 7
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("nathanrchn/zip2zip-test", dtype="auto")