Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sanps
/
fVLM-1.7B
like
0
Image-Text-to-Text
PyTorch
Safetensors
English
foveated_vlm
vision-language
video-understanding
foveated-attention
multimodal
smollm2
dinov2
Eval Results (legacy)
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
fVLM-1.7B
14.3 GB
1 contributor
History:
13 commits
sanps
Upload README.md with huggingface_hub
77b40f5
verified
4 days ago
configs
Upload fVLM-1.7B: Foveated Vision-Language Model (Stage 3 DPO)
6 days ago
model_code
Upload fVLM-1.7B: Foveated Vision-Language Model (Stage 3 DPO)
6 days ago
.gitattributes
Safe
1.52 kB
initial commit
6 days ago
README.md
9.27 kB
Upload README.md with huggingface_hub
4 days ago
benchmark.py
Safe
18.4 kB
Upload benchmark.py
5 days ago
benchmark_results.json
14.1 kB
Upload benchmark_results.json with huggingface_hub
4 days ago
checkpoint.pt
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
10.5 GB
xet
Upload stage 3 (DPO) checkpoint (step 2593)
5 days ago
config.json
Safe
504 Bytes
Upload fVLM-1.7B: Foveated Vision-Language Model (Stage 3 DPO)
6 days ago
data.py
Safe
28.8 kB
Upload data.py
5 days ago
logger.py
Safe
10.1 kB
Upload logger.py
5 days ago
model.py
Safe
42.4 kB
Upload model.py
5 days ago
model.safetensors
3.72 GB
xet
Upload fVLM-1.7B: Foveated Vision-Language Model (Stage 3 DPO)
6 days ago
train.py
Safe
39.5 kB
Upload train.py
5 days ago