video_rag_v4 / README.md
aircrypto's picture
Add trained model and README
c293a0f

My CLIP Video-Text Model

This model was trained on the MSR-VTT dataset using a custom CLIP-based architecture. Now using an N-pairs margin loss for training.