abi96062's picture
Update README.md
9b57eb5 verified
|
raw
history blame
728 Bytes
metadata
title: SmolLM2-135M From Scratch
emoji: πŸ€–
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit

SmolLM2-135M: Complete From-Scratch Implementation

This Space demonstrates a complete reverse-engineered implementation of SmolLM2-135M.

Features

  • πŸ” Reverse-engineered architecture
  • πŸ‹οΈ Trained for 5,000+ steps
  • βœ… Checkpoint validation
  • ⚑ Optimized with Flash Attention & Mixed Precision

Links

Usage

Enter a prompt and adjust generation parameters to see the model in action!