SemaAli99 commited on
Commit
e0b41b3
·
verified ·
1 Parent(s): e8b7e98

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # FashionMNIST Model Repository - Assignment 4
2
+
3
+ This repository contains two deep learning models trained on the **FashionMNIST** dataset as part of the AI417 Deep Learning course at UPM (Spring 2026).
4
+
5
+ ## Models Included
6
+
7
+ ### 1. Dummy Model (`dummy`)
8
+ - **Architecture**: A simple feed-forward network with layers of sizes 784 -> 512 -> 784 -> 10.
9
+ - **Initialization**: Xavier Uniform weights and zero biases.
10
+ - **Optimizer**: Stochastic Gradient Descent (SGD) with momentum 0.9 and weight decay 5e-4.
11
+
12
+ ### 2. Vanilla Model (`vanilla`)
13
+ - **Architecture**: A deeper feed-forward network with layers 784 -> 1024 -> 512 -> 10 using ReLU activations.
14
+ - **Initialization**: Kaiming Normal initialization, which is optimized for ReLU layers.
15
+ - **Optimizer**: AdamW optimizer for faster and more stable convergence.
16
+
17
+ ## How to use with PyTorch Hub
18
+
19
+ You can load these models directly into your Python environment using `torch.hub`.
20
+
21
+ ### Load the Vanilla Model (Pretrained)
22
+ ```python
23
+ import torch
24
+
25
+ repo = 'AI417UPM/A4_4310823_Sema'
26
+ model = torch.hub.load(repo, 'vanilla', pretrained=True)
27
+ model.eval()