pranikz
/

SLM-Movie-Script

Model card Files Files and versions

pranikz commited on Sep 28, 2025

Commit

3f5dcff

·

verified ·

1 Parent(s): e5d678d

Update README.md

Files changed (1) hide show

README.md +35 -1

README.md CHANGED Viewed

@@ -1,5 +1,12 @@
 ---
 license: mit
 ---
 # 🚀 Small Language Model (SLM) from Scratch — Explained
@@ -222,6 +229,33 @@ plt.show()
 ---
 ## 6. Inference
 ```python
@@ -251,4 +285,4 @@ For practical testing, use **200–500 tokens**.
 - **Evaluation**: Loss curves (train vs val).
 - **Inference**: Autoregressive generation with temperature & top-k control.
-This is essentially a **mini GPT-2 clone**, scaled down for small datasets like movie scripts.

 ---
 license: mit
+datasets:
+- IsmaelMousa/movies
+tags:
+- movie
+- short_stories
+- llm
+- slm
 ---
 # 🚀 Small Language Model (SLM) from Scratch — Explained
 ---
+## 📊 Training Metrics
+| Epoch | Train Loss | Val Loss | Perplexity |
+|-------|------------|----------|------------|
+| 500   | 6.0358     | 6.0601   | 430.1      |
+| 1000  | 5.0690     | 5.1143   | 166.0      |
+| 1500  | 4.3162     | 4.3407   | 76.7       |
+| 2000  | 3.5948     | 3.6099   | 36.9       |
+| 2500  | 3.0460     | 3.0569   | 21.3       |
+| 3000  | 2.7518     | 2.7398   | 15.5       |
+| 3500  | 2.5606     | 2.5574   | 12.9       |
+| 4000  | 2.4583     | 2.4691   | 11.8       |
+| 4500  | 2.3943     | 2.3969   | 11.0       |
+| 5000  | 2.3428     | 2.3513   | 10.5       |
+| 6000  | 2.2141     | 2.2155   | 9.17       |
+| 7000  | 2.1389     | 2.1577   | 8.65       |
+| 8000  | 2.0570     | 2.0703   | 7.93       |
+| 9000  | 2.0062     | 2.0210   | 7.55       |
+| 10000 | 1.9604     | 1.9715   | 7.18       |
+| 12000 | 1.8580     | 1.8924   | 6.64       |
+| 14000 | 1.7954     | 1.8284   | 6.23       |
+| 16000 | 1.7369     | 1.7937   | 5.95       |
+| 18000 | 1.6901     | 1.7314   | 5.65       |
+| 19500 | 1.6594     | 1.7216   | 5.60       |
+📉 Validation loss steadily decreases, and **perplexity drops from ~430 → ~5.6** over training.
 ## 6. Inference
 ```python
 - **Evaluation**: Loss curves (train vs val).
 - **Inference**: Autoregressive generation with temperature & top-k control.
+This is essentially a **mini GPT-2 clone**, scaled down for small datasets like movie scripts.