Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sunnyg 's Collections
LoRAs
Lean
Corpus
SWE
Robotics
Tooling
Research
Video
TTI
STT/TTS
3D
Vision
Agent Research
Reasoning
Quantization

Vision

updated Nov 16
Upvote
-

  • facebook/sapiens

    Updated Sep 20, 2024 • 134 • 243

  • microsoft/TRELLIS-image-large

    Image-to-3D • Updated Dec 6, 2024 • 2.49M • 606

  • vikhyatk/moondream2

    Image-Text-to-Text • 2B • Updated Sep 23 • 2.02M • 1.35k

  • allenai/olmOCR-7B-0225-preview

    Image-to-Text • 8B • Updated Aug 19 • 5.99k • 706

  • microsoft/OmniParser-v2.0

    Updated Mar 28 • 807 • 1.31k

  • CohereLabs/aya-vision-32b

    Image-Text-to-Text • 33B • Updated Oct 30 • 307 • • 217

  • reducto/RolmOCR

    Image-to-Text • 8B • Updated Apr 2 • 2.76k • 568

  • ByteDance-Seed/BAGEL-7B-MoT

    Any-to-Any • 15B • Updated 18 days ago • 936 • 1.17k

  • Hcompany/Holo1-7B

    Image-Text-to-Text • 8B • Updated Jun 10 • 407 • 224

  • CohereLabs/command-a-vision-07-2025

    Image-Text-to-Text • 112B • Updated Oct 30 • 40k • • 85

  • apple/FastVLM-7B

    Text Generation • 8B • Updated Sep 3 • 804 • 264

  • moondream/moondream3-preview

    Image-Text-to-Text • 9B • Updated Oct 9 • 6.24k • • 527

  • facebook/seamless-interaction

    Updated Jul 14 • 32.6k • 164

  • yonigozlan/EdgeTAM-hf

    Mask Generation • 13.9M • Updated Nov 6 • 5.77k • 67

  • zai-org/WebVIA-Agent

    Image-Text-to-Text • 10B • Updated Nov 12 • 56 • 16
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs