Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sbintuitions
/
sarashina2.2-ocr
like
23
Follow
SB Intuitions
285
Image-to-Text
Transformers
Safetensors
Japanese
English
sarashina2_vision
text-generation
multimodal
ocr
document-understanding
vision-language
custom_code
arxiv:
2503.09208
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
sarashina2.2-ocr
/
preprocessor_config.json
Commit History
Initial commit
be3a9b8
toshi-456
commited on
8 days ago