Just Read Twice!
Collection
Here we provide models and benchmarks for the Just Read Twice work: https://arxiv.org/abs/2407.05483 • 16 items • Updated
How to use hazyresearch/based-360M-30B with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("hazyresearch/based-360M-30B", dtype="auto")Configuration Parsing Warning:In config.json: "architectures" must be an array