Running on Zero Agents 5 SAM-3 vs SAM-3-LiteText 🖼 5 Compare text‑guided image segmentation with two SAM‑3 models
SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation Paper • 2602.12173 • Published Feb 12 • 3
ibm-granite/granite-embedding-311m-multilingual-r2 Feature Extraction • 0.3B • Updated 2 days ago • 24.1k • • 85
Running Featured 77 Nemotron 3 Nano WebGPU ⚛ 77 A compact reasoning-capable model running in your browser.
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20, 2025 • 164
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 895
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated Feb 27 • 39
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 18 items • Updated 1 day ago • 294
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases QuentinJG • Nov 5, 2025 • 64
Contextual AI Reranker v2 Collection Family of instruction-following multilingual rerankers on the cost/performance Pareto frontier across public and customer benchmarks • 9 items • Updated 28 days ago • 11
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding Paper • 2202.13669 • Published Feb 28, 2022 • 3
view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG zilliz • Jan 15 • 67