Spaces:
Sleeping
Sleeping
metadata
title: Tiny Transformer Trainer
emoji: 🧠
colorFrom: purple
colorTo: blue
sdk: docker
app_port: 7860
pinned: false
Tiny Transformer Trainer 🧠
Train small GPT-style transformers from your own text data — directly in the browser.
How to Use
Upload data:
.txt— plain text files.csv— Q&A pairs with named columns.json— Q&A pairs as a list of objects
Set column names (for CSV/JSON): specify which columns hold questions/prompts and answers/responses.
Configure model: pick hidden size, layers, heads, vocab size, and max sequence length.
Train: click 🚀 Start Training and watch live logs.
Download: grab the
.zipwith model weights + tokenizer.
Output Files
| File | Description |
|---|---|
model/ |
Trained GPT2LMHeadModel weights + config |
tokenizer/ |
Trained PreTrainedTokenizerFast vocab + config |
README.md |
Usage code snippet |
Local Usage
from transformers import GPT2LMHeadModel, PreTrainedTokenizerFast
model = GPT2LMHeadModel.from_pretrained("trained_model_output/model")
tokenizer = PreTrainedTokenizerFast.from_pretrained("trained_model_output/tokenizer")