PyTorch
TinyCLIP / README.md
cooper_robot
Add release note for v1.3.0
8cb365e
|
Raw
History Blame Contribute Delete
2.47 kB
metadata
library_name: pytorch

tinyclip_logo

TinyCLIP is a compact vision–language model that compresses CLIP through knowledge distillation, enabling efficient image–text representation learning with significantly lower compute and memory requirements.

Original paper: TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance, Wu et al., 2023

TinyCLIP-ViT8M16

This model uses the TinyCLIP variant, optimized for efficient image–text embedding generation while preserving strong zero-shot classification and retrieval performance. It is well suited for applications such as image retrieval, zero-shot classification, multimodal search, and edge vision-language deployments.

Model Configuration:

Model Device Model Link
TinyCLIP-ViT8M16 Image Encoder N1-655 Model_Link
TinyCLIP-ViT8M16 Text Encoder N1-655 Model_Link
TinyCLIP-ViT8M16 Image encoder CV7 Model_Link
TinyCLIP-ViT8M16 Text Encoder CV7 Model_Link
TinyCLIP-ViT8M16 Image encoder CV72 Model_Link
TinyCLIP-ViT8M16 Text Encoder CV72 Model_Link
TinyCLIP-ViT8M16 Image encoder CV75 Model_Link
TinyCLIP-ViT8M16 Text Encoder CV75 Model_Link