Video-Text-to-Text
Transformers
Safetensors
English
Chinese
qwen2_5_omni
text-to-audio
omni-modal
video-understanding
multimodal-llm
agent
active-perception
reinforcement-learning
Instructions to use harryhsing/OmniAgent-RL-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use harryhsing/OmniAgent-RL-7B with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("harryhsing/OmniAgent-RL-7B") model = AutoModelForMultimodalLM.from_pretrained("harryhsing/OmniAgent-RL-7B") - Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!