File size: 6,586 Bytes
81393ba
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
---

license: mit
tags:
- deepfake-detection
- computer-vision
- image-classification
- xceptionnet
- face-forensics
- pytorch
- synthetic-media
- media-forensics
metrics:
- accuracy
- auc
- f1
language:
- en
library_name: pytorch
pipeline_tag: image-classification
---


# FaceForge Detector: State-of-the-Art Deepfake Detection

[![Paper](https://img.shields.io/badge/Paper-Zenodo-blue)](https://doi.org/10.5281/zenodo.18530439)
[![GitHub](https://img.shields.io/badge/GitHub-Repository-black)](https://github.com/Huzaifanasir95/FaceForge)
[![License](https://img.shields.io/badge/License-MIT-green.svg)](https://opensource.org/licenses/MIT)

🎯 **99.33% Accuracy | 0.9995 AUC-ROC | 0.67% Error Rate**

## Model Description

FaceForge Detector is a high-performance deepfake detection model based on XceptionNet architecture that achieves state-of-the-art results on the FaceForensics++ dataset. This model can distinguish authentic faces from AI-generated deepfakes with exceptional accuracy.

**Key Features:**
- πŸš€ 99.33% accuracy on test set (1,500 samples)
- πŸ“Š 0.9995 AUC-ROC score
- ⚑ <200ms inference time per image
- 🎯 Only 2 false negatives out of 750 deepfakes (0.27% miss rate)
- πŸ”§ 22M trainable parameters
- πŸ’» Trained efficiently on CPU (7.5 hours)

## Model Architecture

```

XceptionNet Backbone (20.8M params)

β”œβ”€β”€ Entry Flow

β”œβ”€β”€ Middle Flow (8 blocks)

β”œβ”€β”€ Exit Flow

β”œβ”€β”€ Global Average Pooling

└── Custom Classification Head (1.1M params)

    β”œβ”€β”€ Dropout (p=0.5)

    β”œβ”€β”€ FC(2048 β†’ 512) + ReLU

    β”œβ”€β”€ Dropout (p=0.3)

    └── FC(512 β†’ 2) [Real/Fake]

```

## Performance Metrics

### Test Set Results (1,500 samples)

| Metric | Value |
|--------|-------|
| **Accuracy** | 99.33% |
| **AUC-ROC** | 0.9995 |
| **Precision** | 98.94% |
| **Recall** | 99.73% |
| **F1 Score** | 0.9934 |
| **Specificity** | 98.93% |

### Confusion Matrix

|  | Predicted Real | Predicted Fake |
|--|----------------|----------------|
| **Actual Real** | 742 (TN) | 8 (FP) |
| **Actual Fake** | 2 (FN) | 748 (TP) |

**Total Errors:** 10 out of 1,500 (0.67%)

## Comparison with State-of-the-Art

| Method | Accuracy | AUC-ROC |
|--------|----------|---------|
| MesoNet | 83.1% | 0.847 |
| XceptionNet (base) | 95.3% | 0.969 |
| EfficientNet-B4 | 96.8% | 0.981 |
| Capsule Network | 97.4% | 0.986 |
| Face X-ray | 98.1% | 0.991 |
| **FaceForge (Ours)** | **99.33%** | **0.9995** |

## Usage

### Installation

```bash

pip install torch torchvision timm pillow numpy

```

### Inference

```python

import torch

import timm

from PIL import Image

from torchvision import transforms



# Load model

model = timm.create_model('xception', pretrained=False, num_classes=2)

checkpoint = torch.load('detector_best.pth', map_location='cpu')

model.load_state_dict(checkpoint['model_state_dict'])

model.eval()



# Preprocessing

transform = transforms.Compose([

    transforms.Resize((224, 224)),

    transforms.ToTensor(),

    transforms.Normalize(mean=[0.5, 0.5, 0.5], std=[0.5, 0.5, 0.5])

])



# Inference

def detect_deepfake(image_path):

    image = Image.open(image_path).convert('RGB')

    input_tensor = transform(image).unsqueeze(0)

    

    with torch.no_grad():

        logits = model(input_tensor)

        probs = torch.softmax(logits, dim=1)

        prediction = torch.argmax(probs, dim=1).item()

        confidence = probs[0][prediction].item()

    

    label = "REAL" if prediction == 0 else "FAKE"

    return label, confidence



# Example

label, confidence = detect_deepfake("face.jpg")

print(f"Prediction: {label} (Confidence: {confidence:.2%})")

```

### Gradio Demo

```python

import gradio as gr



def predict(image):

    label, confidence = detect_deepfake(image)

    return f"{label} ({confidence:.2%})"



demo = gr.Interface(

    fn=predict,

    inputs=gr.Image(type="filepath"),

    outputs="text",

    title="FaceForge Deepfake Detector",

    description="Upload a face image to detect if it's real or AI-generated"

)



demo.launch()

```

## Training Details

### Dataset
- **Source:** FaceForensics++ (c40 compression)
- **Training:** 7,000 images (3,500 real + 3,500 fake)
- **Validation:** 1,500 images (750 real + 750 fake)
- **Test:** 1,500 images (750 real + 750 fake)

### Hyperparameters
```yaml

optimizer: AdamW

learning_rate: 1e-4

weight_decay: 1e-4

batch_size: 32

epochs: 10

lr_schedule: Cosine Annealing (1e-4 β†’ 1e-6)

augmentation:

  - Random Horizontal Flip (p=0.5)

  - Color Jitter (brightness=0.2, contrast=0.2, saturation=0.2)

```

### Training Time
- **Total:** 7.52 hours (451.5 minutes)
- **Per Epoch:** ~45 minutes
- **Hardware:** CPU (no GPU required)
- **Best Checkpoint:** Epoch 8 (0.9998 AUC-ROC)

## Limitations

1. **Dataset Scope:** Trained on FaceForensics++ deepfakes; may need fine-tuning for other manipulation methods
2. **Single Frame:** Processes individual images; doesn't leverage temporal information from videos
3. **Compression:** Trained on c40 compression; performance may vary with different quality levels
4. **Domain:** Optimized for face-centric images; may struggle with partial faces or unusual angles

## Ethical Considerations

This model is intended for:
βœ… Research and education
βœ… Content moderation
βœ… Forensic analysis
βœ… Fact-checking

**Not intended for:**
❌ Malicious surveillance
❌ Discriminatory profiling
❌ Invasion of privacy

## Citation

```bibtex

@techreport{nasir2026faceforge,

  title={FaceForge: A Deep Learning Framework for Facial Manipulation Generation and Detection},

  author={Nasir, Huzaifa},

  institution={National University of Computer and Emerging Sciences},

  year={2026},

  doi={10.5281/zenodo.18530439}

}

```

## Links

- πŸ“„ **Paper:** https://doi.org/10.5281/zenodo.18530439
- πŸ’» **Code:** https://github.com/Huzaifanasir95/FaceForge
- 🎨 **Generator Model:** https://huggingface.co/Huzaifanasir95/faceforge-generator
- 🌐 **Interactive Demo:** See notebook `03_detector_and_adversarial.ipynb`

## License

This model is released under CC BY 4.0 license. See LICENSE file for details.

## Author

**Huzaifa Nasir**  
National University of Computer and Emerging Sciences (NUCES)  
Islamabad, Pakistan  
πŸ“§ nasirhuzaifa95@gmail.com

## Acknowledgments

- FaceForensics++ dataset creators
- PyTorch and timm library developers
- XceptionNet architecture (FranΓ§ois Chollet)