microgpt-deva / config.json
ss-76's picture
Initial upload of MicroGPT-Deva model
b2fd5c3 verified
raw
history blame
147 Bytes
{"batch_size": 32, "block_size": 512, "dropout": 0.0, "lr": 0.0003, "n_embd": 512, "n_head": 8, "n_layer": 8, "num_epochs": 1, "vocab_size": 12000}