cicero / config.json
gmmeyer's picture
Cicero LLM: curriculum-tuned 111M Latin model (ONNX + PyTorch + tokenizer)
94d35d6 verified
raw
history blame contribute delete
242 Bytes
{
"model_type": "cicero-gpt",
"vocab_size": 32000,
"n_layer": 12,
"n_head": 12,
"n_embd": 768,
"block_size": 2048,
"tokenizer": "sentencepiece-bpe-32k (cicero_sp_32000)",
"source": "MAX-V5 step 30000 (canonical cloze 0.804)"
}