cicero / config.json

Cicero LLM: curriculum-tuned 111M Latin model (ONNX + PyTorch + tokenizer)

94d35d6 verified 8 days ago

242 Bytes

	{
	"model_type": "cicero-gpt",
	"vocab_size": 32000,
	"n_layer": 12,
	"n_head": 12,
	"n_embd": 768,
	"block_size": 2048,
	"tokenizer": "sentencepiece-bpe-32k (cicero_sp_32000)",
	"source": "MAX-V5 step 30000 (canonical cloze 0.804)"
	}