metadata
title: SmolLM2-135M From Scratch
emoji: π€
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
SmolLM2-135M: Complete From-Scratch Implementation
This Space demonstrates a complete reverse-engineered implementation of SmolLM2-135M.
Features
- π Reverse-engineered architecture
- ποΈ Trained for 5,000+ steps
- β Checkpoint validation
- β‘ Optimized with Flash Attention & Mixed Precision
Links
- GitHub Repository: abi2024/smollm2-135-implementation
- Model Details: See the Model Info tab
Usage
Enter a prompt and adjust generation parameters to see the model in action!