dressify-models / README.md
Stylique's picture
Upload README.md with huggingface_hub
4b27c90 verified
---
language: en
license: mit
tags:
- computer-vision
- fashion
- outfit-recommendation
- deep-learning
- resnet
- vision-transformer
---
# Dressify Outfit Recommendation Models
This repository contains the trained models for the Dressify outfit recommendation system.
## Models
### ResNet Item Embedder
- **Architecture**: ResNet50 with custom projection head
- **Purpose**: Generate 512-dimensional embeddings for fashion items
- **Training**: Triplet loss with semi-hard negative mining
- **Input**: Fashion item images (224x224)
- **Output**: L2-normalized 512D embeddings
### ViT Outfit Compatibility Model
- **Architecture**: Vision Transformer encoder
- **Purpose**: Score outfit compatibility from item embeddings
- **Training**: Triplet loss with cosine distance
- **Input**: Variable-length sequence of item embeddings
- **Output**: Compatibility score (0-1)
## Usage
```python
from huggingface_hub import hf_hub_download
import torch
# Download models
resnet_path = hf_hub_download(repo_id="Stylique/dressify-models", filename="resnet_item_embedder_best.pth")
vit_path = hf_hub_download(repo_id="Stylique/dressify-models", filename="vit_outfit_model_best.pth")
# Load models
resnet_model = torch.load(resnet_path)
vit_model = torch.load(vit_path)
```
## Training Details
- **Dataset**: Polyvore Outfits (Stylique/Polyvore)
- **Loss**: Triplet margin loss
- **Optimizer**: AdamW
- **Mixed Precision**: Enabled
- **Hardware**: NVIDIA GPU with CUDA
## Performance
- **ResNet**: ~25M parameters, fast inference
- **ViT**: ~12M parameters, efficient outfit scoring
- **Memory**: Optimized for deployment on Hugging Face Spaces
## Citation
If you use these models in your research, please cite:
```bibtex
@misc{dressify2024,
title={Dressify: Deep Learning for Fashion Outfit Recommendation},
author={Stylique},
year={2024},
url={https://huggingface.co/Stylique/dressify-models}
}
```