What's the context length for this model?

by DesertCookie - opened Jan 31, 2025

Jan 31, 2025

The table on the original model says 128k for the vanilla model and 8k for quantized models. I thus assume, if I go for anything but the FP16 I do get the limited 8k, correct?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment