Text Generation
Transformers
Safetensors
Portuguese
nanothink
NanoThink-5M / config.json
AxionLab-official's picture
Create config.json
c6225cd verified
{
"model_type": "nanothink",
"architectures": ["NanoThink"],
"vocab_size": 1229,
"max_position_embeddings": 256,
"hidden_size": 128,
"num_hidden_layers": 4,
"num_attention_heads": 4,
"intermediate_size": 512,
"hidden_act": "gelu",
"layer_norm_eps": 1e-5,
"initializer_range": 0.02,
"bos_token_id": 1,
"eos_token_id": 2,
"pad_token_id": 0
}