Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sbintuitions
/
sarashina2.2-ocr
like
25
Follow
SB Intuitions
292
Image-to-Text
Transformers
Safetensors
Japanese
English
sarashina2_vision
text-generation
multimodal
ocr
document-understanding
vision-language
custom_code
arxiv:
2503.09208
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
sarashina2.2-ocr
Commit History
Update README.md
eafb8d4
verified
tkmtakada-sbint
commited on
14 days ago
Update readme.md
7d9d23f
verified
tkmtakada-sbint
commited on
18 days ago
update readme
6e0fb4b
verified
tkmtakada-sbint
commited on
18 days ago
Separate bbox image from table
f03a34c
verified
tkmtakada-sbint
commited on
18 days ago
Update README.md
b0eb834
verified
toshi-456
commited on
20 days ago
Initial commit
be3a9b8
toshi-456
commited on
21 days ago