Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation Paper โข 2411.19331 โข Published Nov 28, 2024 โข 5
view article Article AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models By imomayiz and 4 others โข Sep 16 โข 18
YanoljaNEXT-Rosetta Collection Translation Model for JSON-Structured Data โข 3 items โข Updated Sep 3 โข 8
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B โข 37 items โข Updated Sep 18 โข 56
FastVLM Collection Efficient Vision Encoding for Vision Language Models โข 9 items โข Updated Sep 2 โข 103
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 โข 13 items โข Updated Aug 21 โข 366
view article Article Open-Source Handwritten Signature Detection Model By samuellimabraz โข Mar 14 โข 119
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. โข 8 items โข Updated Nov 23, 2024 โข 88
timm tiny test models Collection A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k. โข 13 items โข Updated Sep 19 โข 5
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper โข 2410.02073 โข Published Oct 2, 2024 โข 41