Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,27 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# FashionMNIST Model Repository - Assignment 4
|
| 2 |
+
|
| 3 |
+
This repository contains two deep learning models trained on the **FashionMNIST** dataset as part of the AI417 Deep Learning course at UPM (Spring 2026).
|
| 4 |
+
|
| 5 |
+
## Models Included
|
| 6 |
+
|
| 7 |
+
### 1. Dummy Model (`dummy`)
|
| 8 |
+
- **Architecture**: A simple feed-forward network with layers of sizes 784 -> 512 -> 784 -> 10.
|
| 9 |
+
- **Initialization**: Xavier Uniform weights and zero biases.
|
| 10 |
+
- **Optimizer**: Stochastic Gradient Descent (SGD) with momentum 0.9 and weight decay 5e-4.
|
| 11 |
+
|
| 12 |
+
### 2. Vanilla Model (`vanilla`)
|
| 13 |
+
- **Architecture**: A deeper feed-forward network with layers 784 -> 1024 -> 512 -> 10 using ReLU activations.
|
| 14 |
+
- **Initialization**: Kaiming Normal initialization, which is optimized for ReLU layers.
|
| 15 |
+
- **Optimizer**: AdamW optimizer for faster and more stable convergence.
|
| 16 |
+
|
| 17 |
+
## How to use with PyTorch Hub
|
| 18 |
+
|
| 19 |
+
You can load these models directly into your Python environment using `torch.hub`.
|
| 20 |
+
|
| 21 |
+
### Load the Vanilla Model (Pretrained)
|
| 22 |
+
```python
|
| 23 |
+
import torch
|
| 24 |
+
|
| 25 |
+
repo = 'AI417UPM/A4_4310823_Sema'
|
| 26 |
+
model = torch.hub.load(repo, 'vanilla', pretrained=True)
|
| 27 |
+
model.eval()
|