Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ss-76
/
microgpt-deva
like
0
Text Generation
Transformers
PyTorch
custom
Sanskrit
generative
language-model
sanskrit
devanagari
flashattention
micro-llm
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
c84895e
microgpt-deva
/
config.json
ss-76
Initial upload of MicroGPT-Deva model
b2fd5c3
verified
9 months ago
raw
Copy download link
history
blame
Safe
147 Bytes
{
"batch_size"
:
32
,
"block_size"
:
512
,
"dropout"
:
0.0
,
"lr"
:
0.0003
,
"n_embd"
:
512
,
"n_head"
:
8
,
"n_layer"
:
8
,
"num_epochs"
:
1
,
"vocab_size"
:
12000
}