mnist-modal / README.md
ivanleomk's picture
Upload README.md with huggingface_hub
c08b012 verified
---
license: mit
tags:
- pytorch
- mnist
- modal
---
# MNIST Model Trained on Modal
This is a PyTorch model trained on the MNIST dataset using an NVIDIA A100 GPU on [Modal](https://modal.com).
## Model Details
- **Architecture**: Simple CNN (Conv2D -> Conv2D -> MaxPool -> Dropout -> FC -> Dropout -> FC)
- **Dataset**: MNIST
- **Final Test Accuracy**: 99.02%
## Links
- **W&B Run**: [View training on Weights & Biases](https://wandb.ai/ivanleo97-freelance/mnist-modal/runs/tu4yqtvi)
- **Training Script**: The script used to train this model is available in the repository as `train_mnist.py`.
## Training Configuration
- **Batch Size**: 64
- **Learning Rate**: 1.0 (Adadelta)
- **Epochs**: 5
## Training Metrics
| Epoch | Train Loss | Test Loss | Accuracy |
|:---:|:---:|:---:|:---:|
| 1 | 0.2036 | 0.0441 | 98.40% |
| 2 | 0.0828 | 0.0376 | 98.84% |
| 3 | 0.0622 | 0.0487 | 98.38% |
| 4 | 0.0528 | 0.0301 | 99.05% |
| 5 | 0.0466 | 0.0314 | 99.02% |