Spaces:
Sleeping
Sleeping
| title: SmolLM2-135M From Scratch | |
| emoji: π€ | |
| colorFrom: blue | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 4.44.0 | |
| app_file: app.py | |
| pinned: false | |
| license: mit | |
| # SmolLM2-135M: Complete From-Scratch Implementation | |
| This Space demonstrates a complete reverse-engineered implementation of SmolLM2-135M. | |
| ## Features | |
| - π Reverse-engineered architecture | |
| - ποΈ Trained for 5,000+ steps | |
| - β Checkpoint validation | |
| - β‘ Optimized with Flash Attention & Mixed Precision | |
| ## Links | |
| - **GitHub Repository**: [abi2024/smollm2-135-implementation](https://github.com/abi2024/smollm2-135-implementation) | |
| - **Model Details**: See the Model Info tab | |
| ## Usage | |
| Enter a prompt and adjust generation parameters to see the model in action! |