Create README.md
Browse filesA Small 33M parameter decoder only model with flash attention trained on Sanskrit texts. Uses custom SentencePiece Devanagari BPE tokenizer
A Small 33M parameter decoder only model with flash attention trained on Sanskrit texts. Uses custom SentencePiece Devanagari BPE tokenizer