# My CLIP Video-Text Model

This model was trained on the MSR-VTT dataset using a custom CLIP-based architecture.