Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jylins
/
vtsum_blip
like
3
English
cross-modal-video-summarization
video-summarization
video-captioning
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
vtsum_blip
2.97 GB
2 contributors
History:
4 commits
jylins
update readme
f7b4925
almost 2 years ago
.gitattributes
1.52 kB
initial commit
about 2 years ago
README.md
1.43 kB
update readme
almost 2 years ago
vt_clip.pth
1.8 GB
xet
add vt_clip
about 2 years ago
vtsum_tt.pth
582 MB
xet
Initial Commit
about 2 years ago
vtsum_tt_ca.pth
591 MB
xet
Initial Commit
about 2 years ago