legacy-datasets/wikipedia
Updated • 117k • 645
This is a toy language model trained from scratch for educational purposes.
Model Type: Causal Language Model Architecture: Transformer decoder-only Parameters: 30,044,544 Context Length: 256 tokens Vocabulary Size: 50257
import torch
from transformers import AutoModel, AutoTokenizer
model = AutoModel.from_pretrained("Sak2004/I-am-still-learning", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("Sak2004/I-am-still-learning")
inputs = tokenizer("The benefits of renewable energy include", return_tensors="pt")
with torch.no_grad():
outputs = model.generate(inputs.input_ids, max_length=100, temperature=0.8)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
This model is released under the MIT License.
If you use this model, please cite:
@misc{toyllm_2026,
title={I am still learning - A Toy Language Model},
author={Sak2004},
year={2026},
url={https://huggingface.co/Sak2004/I-am-still-learning}
}