NVIDIA Nemotron Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 5 items • Updated 5 days ago • 66
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 3 days ago • 45
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published 10 days ago • 64
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 10 days ago • 67
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 26 items • Updated Sep 24 • 174
PaddleOCR-VL Collection Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 3 items • Updated 9 days ago • 16
Evaluating Arabic Large Language Models: A Survey of Benchmarks, Methods, and Gaps Paper • 2510.13430 • Published 11 days ago • 1
view article Article mem-agent: Equipping LLM Agents with Memory Using RL By driaforall and 1 other • 17 days ago • 30
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published 27 days ago • 136
BERT Hash Nano Models Collection Set of BERT models with a modified embeddings layer • 3 items • Updated 20 days ago • 8
Language Detection Collection StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText • 2 items • Updated Sep 18 • 4
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 20 days ago • 446
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer Paper • 2510.06590 • Published 19 days ago • 69
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9 • 45
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 297