File size: 4,359 Bytes
172e665
a6661e0
 
 
 
 
 
 
 
 
 
 
 
 
 
172e665
 
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
 
 
 
 
172e665
a6661e0
 
 
172e665
a6661e0
172e665
a6661e0
 
 
 
172e665
a6661e0
172e665
a6661e0
 
172e665
a6661e0
 
172e665
a6661e0
 
 
172e665
a6661e0
172e665
a6661e0
 
 
 
172e665
a6661e0
172e665
a6661e0
 
172e665
a6661e0
 
172e665
a6661e0
172e665
a6661e0
 
 
 
 
 
172e665
a6661e0
172e665
a6661e0
 
 
172e665
a6661e0
 
 
172e665
a6661e0
172e665
a6661e0
 
 
 
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
 
 
172e665
a6661e0
 
 
 
172e665
a6661e0
 
 
 
172e665
a6661e0
172e665
a6661e0
 
 
 
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
 
 
 
 
 
 
 
 
 
172e665
a6661e0
172e665
a6661e0
172e665
a6661e0
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
---
license: apache-2.0
base_model: dima806/ai_vs_real_image_detection
tags:
- image-classification
- vision
- ai-detection
- deepfake-detection
- vit
datasets:
- CIFAKE
metrics:
- accuracy
- f1
pipeline_tag: image-classification
---

# CapCheck AI Image Detection

Vision Transformer (ViT) fine-tuned for detecting AI-generated images.

## Model Lineage & Attribution

This model builds on the work of others:

| Layer | Model | Author | License |
|-------|-------|--------|---------|
| Base Architecture | [google/vit-base-patch16-224-in21k](https://huggingface.co/google/vit-base-patch16-224-in21k) | Google | Apache 2.0 |
| AI Detection Fine-tune | [dima806/ai_vs_real_image_detection](https://huggingface.co/dima806/ai_vs_real_image_detection) | dima806 | Apache 2.0 |
| This Model | capcheck/ai-image-detection | CapCheck | Apache 2.0 |

**Special thanks to:**
- **Google** for the Vision Transformer (ViT) architecture
- **dima806** for fine-tuning on the CIFAKE dataset for AI image detection

## Model Description

- **Architecture**: ViT-Base (86M parameters)
- **Input Size**: 224x224 pixels
- **Training Data**: CIFAKE dataset (AI-generated vs real images)
- **Task**: Binary classification (Real vs Fake/AI-generated)

## Usage

```python
from transformers import pipeline

detector = pipeline("image-classification", model="capcheck/ai-image-detection")
result = detector("path/to/image.jpg")

# Output:
# [{"label": "Fake", "score": 0.95}, {"label": "Real", "score": 0.05}]
```

## Labels

| Label | Description |
|-------|-------------|
| `Real` | Authentic photograph or real-world image |
| `Fake` | AI-generated or synthetically created image |

## Performance

This model was trained on the CIFAKE dataset. Performance on modern AI generators
(Flux, Midjourney v6, DALL-E 3, Stable Diffusion 3) may vary.

See [dima806's model card](https://huggingface.co/dima806/ai_vs_real_image_detection)
for detailed training metrics.

## Limitations

- Trained primarily on older AI generators (pre-2024)
- May have reduced accuracy on:
  - Very new AI generators not in training data
  - Heavily compressed images (low JPEG quality)
  - Images smaller than 224x224 pixels
- Works best on images with clear subjects

## Intended Use

- Content moderation and authenticity verification
- Research into AI-generated content detection
- Educational purposes

**Not intended for**:
- Making consequential decisions without human review
- Law enforcement evidence without corroborating sources

## Ethical Considerations

- This tool is not 100% accurate - false positives harm legitimate creators
- False negatives can allow misinformation to spread
- Use in conjunction with other verification methods
- Human review is recommended for high-stakes decisions

## Roadmap

### Current Version (v1.0.0)

Base model from dima806's CIFAKE-trained ViT. Solid foundation for AI detection.

### Planned Improvements

**Phase 1: Modern Generator Training**
- Fine-tune on images from Flux, Midjourney v6, DALL-E 3, Stable Diffusion 3
- Target: Reduce false negatives on 2024+ AI generators

**Phase 2: False Positive Reduction**
- Curate dataset of real images commonly flagged as AI
- Photography edge cases: HDR, heavy editing, digital art
- Target: <5% false positive rate

**Phase 3: Continuous Updates**
- Quarterly re-training as new generators emerge
- Community feedback integration
- Benchmark against latest AI generators

### Contributing

We welcome:
- Dataset contributions (properly licensed images)
- Bug reports and false positive/negative examples
- Benchmark results on new generators

Join the discussion: https://huggingface.co/capcheck/ai-image-detection/discussions

## License

Apache 2.0 (inherited from Google ViT and dima806's fine-tuned model)

## Citation

If you use this model, please cite:

```bibtex
@misc{capcheck-ai-detection,
  author = {CapCheck},
  title = {AI Image Detection Model},
  year = {2024},
  publisher = {HuggingFace},
  url = {https://huggingface.co/capcheck/ai-image-detection},
  note = {Based on dima806/ai_vs_real_image_detection, fine-tuned from google/vit-base-patch16-224-in21k}
}
```

## Changelog

### v1.0.0 (Initial Release)

- Published base model from dima806/ai_vs_real_image_detection
- Added proper attribution and documentation
- Established as CapCheck's source of truth for AI image detection