File size: 728 Bytes
9b57eb5 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 |
---
title: SmolLM2-135M From Scratch
emoji: π€
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
---
# SmolLM2-135M: Complete From-Scratch Implementation
This Space demonstrates a complete reverse-engineered implementation of SmolLM2-135M.
## Features
- π Reverse-engineered architecture
- ποΈ Trained for 5,000+ steps
- β
Checkpoint validation
- β‘ Optimized with Flash Attention & Mixed Precision
## Links
- **GitHub Repository**: [abi2024/smollm2-135-implementation](https://github.com/abi2024/smollm2-135-implementation)
- **Model Details**: See the Model Info tab
## Usage
Enter a prompt and adjust generation parameters to see the model in action! |