PubTables-1M: Towards comprehensive table extraction from unstructured
documents
Paper
•
2110.00061
•
Published
•
3
Optimized Table Tokenization for Table Structure Recognition
Paper
•
2305.03393
•
Published
•
1
Qwen3-VL Technical Report
Paper
•
2511.21631
•
Published
•
148
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper
•
2510.14528
•
Published
•
111
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
•
15.9k
•
1.44k
DeepSeek-OCR: Contexts Optical Compression
Paper
•
2510.18234
•
Published
•
86
Image-Text-to-Text
•
3B
•
Updated
•
3.4M
•
3.03k
HunyuanOCR Technical Report
Paper
•
2511.19575
•
Published
•
22
Image-Text-to-Text
•
1.0B
•
Updated
•
874k
•
703
DocReward: A Document Reward Model for Structuring and Stylizing
Paper
•
2510.11391
•
Published
•
27
SynthDoc: Bilingual Documents Synthesis for Visual Document
Understanding
Paper
•
2408.14764
•
Published
OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal
Document Layout Generation
Paper
•
2510.26213
•
Published
•
9
MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns
Paper
•
2511.10390
•
Published
Structured Document Translation via Format Reinforcement Learning
Paper
•
2512.05100
•
Published
•
1