EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation Paper β’ 2603.18739 β’ Published Mar 19 β’ 11
FakeParts: a New Family of AI-Generated DeepFakes Paper β’ 2508.21052 β’ Published Aug 28, 2025 β’ 8
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25, 2025 β’ 224
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). β’ 45 items β’ Updated Mar 2 β’ 110
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb β’ May 21, 2025 β’ 258
view article Article SmolVLM Grows Smaller β Introducing the 256M & 500M Models! +1 andito, mfarre, merve β’ Jan 23, 2025 β’ 192
view article Article SigLIP 2: A better multilingual vision language encoder +1 ariG23498, merve, qubvel-hf β’ Feb 21, 2025 β’ 216
MUAD: Multiple Uncertainties for Autonomous Driving, a benchmark for multiple uncertainty types and tasks Paper β’ 2203.01437 β’ Published Mar 2, 2022 β’ 1