video_rag_v4 / README.md
aircrypto's picture
Add trained model and README
c293a0f
# My CLIP Video-Text Model
This model was trained on the MSR-VTT dataset using a custom CLIP-based architecture.
Now using an N-pairs margin loss for training.