Tamil Tiny Stories
A Toy model to generate character level stories in Tamil.
Model details
- Architecture: custom decoder-only transformer
- Tokenization: character-level
- Training data source:
neuralnets/multilingual-tinystoriesTamil split (ta)
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "senthil090/tamil-tiny-stories"
tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)
inputs = tokenizer("ஒரு நாள்", return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=100)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
- Downloads last month
- -