--- language: - en license: mit tags: - computer-vision - clip - deduplication --- # AI Visual Gallery This project implements an image deduplication pipeline using visual features extracted by CLIP (Contrastive Language-Image Pre-training). ## Methodology 1. **Feature Extraction**: Images are passed through a Vision Transformer (ViT). 2. **Indexing**: 512-dim feature vectors are cached. 3. **Comparison**: Cosine similarity is used to detect duplicates. ## Hardware Requirements - RAM: 2GB+ (for caching feature maps)