File size: 774 Bytes
1968b67 46c5a00 7ec25da 32d6e9e 7ec25da c1df16d 7ec25da fcf5b2f 7ec25da fcf5b2f 7ec25da |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
---
license: cc-by-nc-4.0
language:
- en
---
Pretrained model for <strong>Deepfake Video Detection Using Generative Convolutional Vision Transformer (GenConViT)</strong> paper.
<strong>GenConViT Model Architecture</strong>
The GenConViT model consists of two independent networks and incorporates the following modules:
Autoencoder (AE),
Variational Autoencoder (VAE), and
ConvNeXt-Swin Hybrid layer
GenConViT is trained using Adam optimizer with a learning rate of 0.0001 and weight decay of 0.0001.
GenConViT is trained on the DFDC, FF++, and TM datasets.
GenConViT model has an average accuracy of 95.8% and an AUC value of 99.3% across the tested datasets (DFDC, FF++, and DeepfakeTIMT, Celeb-DF (v2)).
code link: https://github.com/erprogs/GenConViT
|