Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Lu9876
/
VideoTG_R1
like
0
Video-Text-to-Text
Transformers
Safetensors
yeliudev/VideoMind-Dataset
video-temporal-grounding
multimodal-llm
reinforcement-learning
curriculum-learning
arxiv:
2510.23397
arxiv:
2410.17434
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
VideoTG_R1
/
Intermediate_results
/
partially_annotated_sample_human_300
121 kB
2 contributors
History:
1 commit
Lu9876
Upload folder using huggingface_hub
5677557
verified
4 months ago
didemo_sampled_100_case.json
Safe
39.3 kB
Upload folder using huggingface_hub
4 months ago
internvid_vtime_sampled_100_case.json
Safe
42 kB
Upload folder using huggingface_hub
4 months ago
qvhighlights_sampled_100_case.json
Safe
39.5 kB
Upload folder using huggingface_hub
4 months ago