sanps
/

fVLM-135M

Image-Text-to-Text

vision-language

video-understanding

foveated-attention

Model card Files Files and versions

fVLM-135M / model_code

52.7 kB

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

sanps's picture

Upload fVLM-135M: Foveated Vision-Language Model (Stage 3 DPO)

6d320d6 verified 4 months ago

__init__.py

258 Bytes
Upload fVLM-135M: Foveated Vision-Language Model (Stage 3 DPO) 4 months ago
encoder.py

15.5 kB
Upload fVLM-135M: Foveated Vision-Language Model (Stage 3 DPO) 4 months ago
foveated_vlm.py

37 kB
Upload fVLM-135M: Foveated Vision-Language Model (Stage 3 DPO) 4 months ago