Duplicate from convergingmachine/vcl-face-understanding

5fd97e5 verified 12 months ago

2.49 kB

	```yaml
	---
	language: en
	license: mit
	library_name: pytorch
	tags:
	- face-recognition
	- self-supervised-learning
	- contrastive-learning
	datasets:
	- YFCC-CelebA
	- CelebA
	---
	```
	# VCL: Variational Contrastive Learning for Face Understanding

	VCL is a robust self-supervised learning method designed specifically for face understanding tasks, combining variational contrastive learning with beta-divergence to effectively handle noisy and unlabeled datasets[1].

	## Model Details

	### Model Description

	Developed by: Mehmet Can Yavuz and Berrin Yanikoglu
	Model type: Self-Supervised Variational Contrastive Learning with Applications to Face Understanding
	Language(s): Python
	License: MIT
	Model: ResNet10t

	## Uses

	### Direct Use

	The model is designed for:
	- Face attribute recognition
	- Face verification tasks
	- Multi-label classification problems
	- Learning from noisy and unlabeled datasets

	## Model Architecture

	The architecture consists of three main components:
	- Feature extraction backbone (ResNet10t or VGG11bn)
	- Gaussian sampling head for distribution learning
	- Contrastive learning framework with augmentations

	## Training Details

	### Training Data

	The model was pretrained on the YFCC-CelebA dataset and you can fine-tune on CelebA dataset.

	### Training Procedure

	#### Training Hyperparameters

	Training regime:
	- Optimizer: AdamW
	- Learning rate: 1e-3
	- Weight decay: 0.01
	- Batch size: 128
	- Temperature: 0.07
	- Beta: 0.005

	## Evaluation

	### Results

	Performance on CelebA test set with different pretraining approaches:

	\| Setting \| ResNet10t (1%) \| VGG11bn (1%) \| ResNet10t (10%) \| VGG11bn (10%) \|
	\|---------\|----------------\|---------------\|-----------------\|---------------\|
	\| VCL \| 0.5836 \| 0.5719 \| 0.6848 \| 0.6796 \|
	\| VCL (beta) \| 0.5998 \| 0.5958 \| 0.7098 \| 0.6998 \|

	## How to Get Started with the Model

	```python
	# Installation
	git clone https://github.com/convergingmachine/VCL
	cd VCL
	pip install -r requirements.txt

	# Training
	python train_beta.py


	## Citation

	```bibtex
	@INPROCEEDINGS{10582001,
	author={Yavuz, Mehmet Can and Yanikoglu, Berrin},
	booktitle={2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG)},
	title={Self-Supervised Variational Contrastive Learning with Applications to Face Understanding},
	year={2024},
	pages={1-9},
	doi={10.1109/FG59268.2024.10582001}}
	```

	## Model Card Contact

	For questions about this model, please open an issue in the GitHub repository.