convergedmachine's picture
Duplicate from convergingmachine/vcl-face-understanding
5fd97e5 verified
```yaml
---
language: en
license: mit
library_name: pytorch
tags:
- face-recognition
- self-supervised-learning
- contrastive-learning
datasets:
- YFCC-CelebA
- CelebA
---
```
# VCL: Variational Contrastive Learning for Face Understanding
VCL is a robust self-supervised learning method designed specifically for face understanding tasks, combining variational contrastive learning with beta-divergence to effectively handle noisy and unlabeled datasets[1].
## Model Details
### Model Description
**Developed by:** Mehmet Can Yavuz and Berrin Yanikoglu
**Model type:** Self-Supervised Variational Contrastive Learning with Applications to Face Understanding
**Language(s):** Python
**License:** MIT
**Model:** ResNet10t
## Uses
### Direct Use
The model is designed for:
- Face attribute recognition
- Face verification tasks
- Multi-label classification problems
- Learning from noisy and unlabeled datasets
## Model Architecture
The architecture consists of three main components:
- Feature extraction backbone (ResNet10t or VGG11bn)
- Gaussian sampling head for distribution learning
- Contrastive learning framework with augmentations
## Training Details
### Training Data
The model was pretrained on the YFCC-CelebA dataset and you can fine-tune on CelebA dataset.
### Training Procedure
#### Training Hyperparameters
**Training regime:**
- Optimizer: AdamW
- Learning rate: 1e-3
- Weight decay: 0.01
- Batch size: 128
- Temperature: 0.07
- Beta: 0.005
## Evaluation
### Results
Performance on CelebA test set with different pretraining approaches:
| Setting | ResNet10t (1%) | VGG11bn (1%) | ResNet10t (10%) | VGG11bn (10%) |
|---------|----------------|---------------|-----------------|---------------|
| VCL | 0.5836 | 0.5719 | 0.6848 | 0.6796 |
| VCL (beta) | 0.5998 | 0.5958 | 0.7098 | 0.6998 |
## How to Get Started with the Model
```python
# Installation
git clone https://github.com/convergingmachine/VCL
cd VCL
pip install -r requirements.txt
# Training
python train_beta.py
## Citation
```bibtex
@INPROCEEDINGS{10582001,
author={Yavuz, Mehmet Can and Yanikoglu, Berrin},
booktitle={2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG)},
title={Self-Supervised Variational Contrastive Learning with Applications to Face Understanding},
year={2024},
pages={1-9},
doi={10.1109/FG59268.2024.10582001}}
```
## Model Card Contact
For questions about this model, please open an issue in the GitHub repository.