InternVL3.5-Flash Collection InternVL3.5-Flash is a fast variant of InternVL3.5 using semantic aware dynamic high-resolution strategy. • 9 items • Updated 16 days ago • 6
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 26 items • Updated Sep 24 • 174
ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking Paper • 2506.03487 • Published Jun 4 • 4
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 9 days ago • 58
view article Article Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 11 others • Jun 27 • 29
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15, 2024 • 189
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 168
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated about 2 hours ago • 87
π_0: A Vision-Language-Action Flow Model for General Robot Control Paper • 2410.24164 • Published Oct 31, 2024 • 29