AI417UPM
/

A4_4310823_Sema

Model card Files Files and versions

SemaAli99 commited on Feb 19

Commit

e0b41b3

·

verified ·

1 Parent(s): e8b7e98

Create README.md

Files changed (1) hide show

README.md +27 -0

README.md ADDED Viewed

	@@ -0,0 +1,27 @@

+# FashionMNIST Model Repository - Assignment 4
+This repository contains two deep learning models trained on the **FashionMNIST** dataset as part of the AI417 Deep Learning course at UPM (Spring 2026).
+## Models Included
+### 1. Dummy Model (`dummy`)
+- **Architecture**: A simple feed-forward network with layers of sizes 784 -> 512 -> 784 -> 10.
+- **Initialization**: Xavier Uniform weights and zero biases.
+- **Optimizer**: Stochastic Gradient Descent (SGD) with momentum 0.9 and weight decay 5e-4.
+### 2. Vanilla Model (`vanilla`)
+- **Architecture**: A deeper feed-forward network with layers 784 -> 1024 -> 512 -> 10 using ReLU activations.
+- **Initialization**: Kaiming Normal initialization, which is optimized for ReLU layers.
+- **Optimizer**: AdamW optimizer for faster and more stable convergence.
+## How to use with PyTorch Hub
+You can load these models directly into your Python environment using `torch.hub`.
+### Load the Vanilla Model (Pretrained)
+```python
+import torch
+repo = 'AI417UPM/A4_4310823_Sema'
+model = torch.hub.load(repo, 'vanilla', pretrained=True)
+model.eval()