nanogpt_peer_chess / config.json
adamdad's picture
Push model using huggingface_hub.
0b362cd verified
raw
history blame contribute delete
197 Bytes
{
"bias": false,
"block_size": 1023,
"dropout": 0.0,
"expert_head": 8,
"n_embd": 512,
"n_experts": 16384,
"n_experts_per_head": 16,
"n_head": 8,
"n_layer": 8,
"vocab_size": 32
}