mrfakename's picture
Duplicate from shb777/Llama-3.3-8B-Instruct-128K
ce0a055 verified
metadata
license: llama3.3
base_model:
  - allura-forge/Llama-3.3-8B-Instruct
pipeline_tag: text-generation

Llama 3.3 8B 128K Instruct (Fixed)

Original allura-forge/Llama-3.3-8B-Instruct, Thanks!

Evals and GGUF's

Additional Fixes:

  • Added rope_scaling
  • Added chat template (Unsloth) in tokenizer config
  • Updated generation config
  • Enabled full context length