File size: 468 Bytes
ba0cc74 6500eee ba0cc74 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
---
license: apache-2.0
tags:
- vision-language
- video
- internvl
- homework
---
# InterVL-HW1
Trained and exported on 2025-10-13_11-29-14.
- Backbone: InternVLChatModel
- AMP dtype: bfloat16
- Uses video pixel_values with temporal mean-pooling in vision encoder.
- Includes training checkpoint in `checkpoints/`.
> If you trained with a monkey-patched forward, runtime weights are still standard. You can reuse them with the original InternVLChatModel codebase.
|