Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sbintuitions
/
sarashina2.2-vision-3b
like
17
Follow
SB Intuitions
309
Image-to-Text
Transformers
Safetensors
Japanese
English
sarashina2_vision
text-generation
multimodal
vision-language
custom_code
arxiv:
5 papers
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
sarashina2.2-vision-3b
7.61 GB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
toshi-456
Update config.json
3d4feec
verified
2 months ago
.gitattributes
Safe
1.57 kB
Upload 16 files
5 months ago
LICENSE
Safe
1.07 kB
Upload 16 files
5 months ago
README.md
Safe
6.95 kB
Upload README.md
5 months ago
chat_template.json
Safe
1.12 kB
Upload 16 files
5 months ago
config.json
Safe
1.53 kB
Update config.json
2 months ago
configuration_sarashina2_vision.py
Safe
2.92 kB
Upload 16 files
5 months ago
generation_config.json
Safe
133 Bytes
Upload 16 files
5 months ago
model.safetensors
Safe
7.6 GB
xet
Upload 16 files
5 months ago
modeling_sarashina2_vision.py
Safe
11.7 kB
Upload 16 files
5 months ago
preprocessor_config.json
Safe
646 Bytes
Upload 16 files
5 months ago
processing_sarashina2_vision.py
Safe
24 kB
Upload 16 files
5 months ago
processor_config.json
Safe
152 Bytes
Upload 16 files
5 months ago
sample.jpg
Safe
819 kB
xet
Upload 16 files
5 months ago
special_tokens_map.json
Safe
968 Bytes
Upload 16 files
5 months ago
tokenizer.json
Safe
6.72 MB
Upload 16 files
5 months ago
tokenizer.model
Safe
1.83 MB
xet
Upload 16 files
5 months ago
tokenizer_config.json
Safe
5.05 kB
Upload 16 files
5 months ago