Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenGVLab
/
VideoChat-TPO
like
5
Follow
OpenGVLab
1.83k
Video-Text-to-Text
Transformers
Safetensors
feature-extraction
custom_code
arxiv:
2412.19326
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
VideoChat-TPO
/
third_party
/
cgdetr
/
cg_detr
331 kB
3 contributors
History:
1 commit
This model has 1 file scanned as suspicious.
Show
files
ynhe
init
16dc4f2
about 1 year ago
__pycache__
init
about 1 year ago
scripts
init
about 1 year ago
__init__.py
0 Bytes
init
about 1 year ago
attention.py
20.8 kB
init
about 1 year ago
config.py
16.2 kB
init
about 1 year ago
crossattention.py
21 kB
init
about 1 year ago
inference.py
18.5 kB
init
about 1 year ago
matcher.py
5.68 kB
init
about 1 year ago
misc.py
499 Bytes
init
about 1 year ago
model.py
63.9 kB
init
about 1 year ago
position_encoding.py
4.35 kB
init
about 1 year ago
postprocessing_cg_detr.py
3.85 kB
init
about 1 year ago
span_utils.py
4.04 kB
init
about 1 year ago
start_end_dataset.py
17 kB
init
about 1 year ago
text_encoder.py
1.78 kB
init
about 1 year ago
train.py
11 kB
init
about 1 year ago
transformer.py
37.7 kB
init
about 1 year ago