Spaces:

sumedh
/

tinyllama-math-demo

Paused

lil-sumedhk

Add training and evaluation notebooks

06ab972 about 1 month ago

1.42 kB

A newer version of the Gradio SDK is available: 6.9.0

title: TinyLlama Math Fine-tuning Demo
emoji: 🧮
colorFrom: blue
colorTo: purple
sdk: gradio
app_file: app.py
pinned: false

TinyLlama Math Fine-tuning Demo

Compare base TinyLlama vs fine-tuned TinyLlama on math word problems from GSM8K.

The fine-tuned model learns:

However, as a 1.1B parameter model, complex multi-step calculations may still contain errors.

This repository includes the full training and evaluation code:

Fine-Tuning_TinyLlama_Math.ipynb - Training notebook (run on Google Colab with GPU)
Evaluation_TinyLlama_Math.ipynb - Evaluation notebook comparing base vs fine-tuned model