mgpt2-sft / config.json
ace-1's picture
Publish mgpt2 sft checkpoint (step 1262, val_loss 1.240358)
09246b1 verified
{
"architectures": [
"GPT"
],
"model_type": "mgpt2",
"block_size": 1024,
"vocab_size": 50304,
"n_layer": 12,
"n_head": 12,
"n_embd": 768,
"tokenizer_kind": "mgpt2_regex_bpe"
}