Duplicated from shb777/Llama-3.3-8B-Instruct-128K

llamabackup
/

Llama-3.3-8B-Instruct-128K

Text Generation

Model card Files Files and versions

Llama-3.3-8B-Instruct-128K / README.md

mrfakename's picture

Duplicate from shb777/Llama-3.3-8B-Instruct-128K

ce0a055 verified 4 months ago

|

history blame contribute delete

595 Bytes

license: llama3.3
base_model:
  - allura-forge/Llama-3.3-8B-Instruct
pipeline_tag: text-generation

Llama 3.3 8B 128K Instruct (Fixed)

Original allura-forge/Llama-3.3-8B-Instruct, Thanks!

Evals and GGUF's

Additional Fixes:

Added rope_scaling
Added chat template (Unsloth) in tokenizer config
Updated generation config
Enabled full context length