Video-Text-to-Text
Transformers
Safetensors
video-temporal-grounding
multimodal-llm
reinforcement-learning
curriculum-learning
Instructions to use Lu9876/VideoTG_R1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Lu9876/VideoTG_R1 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Lu9876/VideoTG_R1", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Enhance model card for VideoTG-R1
#1
by nielsr HF Staff - opened
This PR replaces the placeholder model card with detailed information for the VideoTG-R1 model.
It includes:
- Updating metadata with
pipeline_tag: video-text-to-text,library_name: transformers,datasets: yeliudev/VideoMind-Dataset, and relevant tags likevideo-temporal-grounding,multimodal-llm,reinforcement-learning, andcurriculum-learning. - A link to the paper (VideoTG-R1: Boosting Video Temporal Grounding via Curriculum Reinforcement Learning on Reflected Boundary Annotations).
- A link to the GitHub repository (https://github.com/ldong1111/VideoTG-R1).
- A descriptive abstract and methodology visualizations.
- Comprehensive usage, training, and evaluation instructions with
bashcode snippets directly from the official GitHub README.
This update will significantly improve discoverability and usability for researchers interested in Video Temporal Grounding.
Lu9876 changed pull request status to merged