The Idea
Why?
Most AI models compete by getting bigger.
AtomZephyr competes by removing parameters until people start questioning whether it's still a neural network.
Every parameter had to earn its place.
Most didn't.
Specifications
| Property | Value |
|---|---|
| Parameters | 27 |
| Architecture | GPT-2 |
| Layers | 1 |
| Attention Heads | 1 |
| Embedding Size | 1 |
| FFN Size | 1 |
| Context Length | 4 |
| Vocabulary | 5 Tokens |
| Model Size | <5 KB |
| Training Time | ~6 Seconds (CPU) |
Performance
| Test | Result |
|---|---|
| Understand English | β |
| Write Code | β |
| Solve Math | β |
| Generate "abba" | β |
| Break Expectations | β |
Quick Start
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("WhirlwindAI/AtomZephyr")
model = AutoModelForCausalLM.from_pretrained("WhirlwindAI/AtomZephyr")
prompt = "a"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(
**inputs,
do_sample=True,
temperature=1.7,
max_length=4
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Possible output
abaa
Groundbreaking.
Example Conversation
User
Tell me a joke.
AtomZephyr
abba
Technically...
that's an answer.
Scientific Achievement
Removing parameters is easy.
Keeping a transformer alive afterwards...
isn't.
AtomZephyr exists purely to explore the absolute lower limits of transformer architectures while remaining a real, trainable language model.
Whether it's useful is a completely different discussion.
Awards
π₯ Smallest Model That Still Has Self-Respect
π Best Binary Poetry Generator
π₯ Most Efficient Waste Of Six Seconds
ποΈ Official Representative Of Tiny AI
Limitations
AtomZephyr should not be used for:
- Programming
- Translation
- Question Answering
- Homework
- Anything important
It performs significantly better when asked to do absolutely nothing useful.
Fun Facts
- Fits inside most PNG images.
- Smaller than many neural network tutorials.
- Downloads faster than this README loads.
- Has fewer parameters than some calculator manuals have pages.
License
MIT
Take it apart.
Make it smaller.
Break another record.
- Downloads last month
- 643