File size: 468 Bytes
ba0cc74
 
 
 
 
 
 
 
 
 
 
6500eee
ba0cc74
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
license: apache-2.0
tags:
- vision-language
- video
- internvl
- homework
---

# InterVL-HW1

Trained and exported on 2025-10-13_11-29-14.

- Backbone: InternVLChatModel
- AMP dtype: bfloat16
- Uses video pixel_values with temporal mean-pooling in vision encoder.
- Includes training checkpoint in `checkpoints/`.

> If you trained with a monkey-patched forward, runtime weights are still standard. You can reuse them with the original InternVLChatModel codebase.