cognitive-reasoners / configs /micro_llama_1b.yml
bkhmsi's picture
created micro hf space
582ea12
raw
history blame contribute delete
249 Bytes
run-title: micro-llama-1b
model: micro-llama-1b
base-model: meta-llama/Llama-3.2-1B
tokenizer: meta-llama/Llama-3.2-1B-Instruct
num-experts: 4
top-k-experts: 1
jitter-noise: 0
use-router: True
mask-input: True
max-length: 8192
trainable:
- model