abi96062's picture
Update README.md
9b57eb5 verified

A newer version of the Gradio SDK is available: 6.1.0

Upgrade
metadata
title: SmolLM2-135M From Scratch
emoji: πŸ€–
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit

SmolLM2-135M: Complete From-Scratch Implementation

This Space demonstrates a complete reverse-engineered implementation of SmolLM2-135M.

Features

  • πŸ” Reverse-engineered architecture
  • πŸ‹οΈ Trained for 5,000+ steps
  • βœ… Checkpoint validation
  • ⚑ Optimized with Flash Attention & Mixed Precision

Links

Usage

Enter a prompt and adjust generation parameters to see the model in action!