qualcomm
/

OpenAI-Clip

@@ -13,9 +13,12 @@ tags:
 # OpenAI-Clip: Optimized for Mobile Deployment
 ## Multi-modal foundational model for vision and language tasks like image/text similarity and for zero-shot image classification
 Contrastive Language-Image Pre-Training (CLIP) uses a ViT like transformer to get visual features and a causal language model to get the text features. Both the text and visual features can then be used for a variety of zero-shot learning tasks.
-This model is an implementation of OpenAI-Clip found [here]({source_repo}).
 This repository provides scripts to run OpenAI-Clip on Qualcomm® devices.
 More details on model performance across various devices, can be found
 [here](https://aihub.qualcomm.com/models/openai_clip).

 # OpenAI-Clip: Optimized for Mobile Deployment
 ## Multi-modal foundational model for vision and language tasks like image/text similarity and for zero-shot image classification
 Contrastive Language-Image Pre-Training (CLIP) uses a ViT like transformer to get visual features and a causal language model to get the text features. Both the text and visual features can then be used for a variety of zero-shot learning tasks.
+This model is an implementation of Posenet-Mobilenet found [here](https://github.com/openai/CLIP/).
 This repository provides scripts to run OpenAI-Clip on Qualcomm® devices.
 More details on model performance across various devices, can be found
 [here](https://aihub.qualcomm.com/models/openai_clip).