Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xlalex 's Collections
encoder
data
svg
video
interleaved
ocr
3d
world model
omni
infra
synthesis
perception
survey
RL
critic
speech full duplex
agent
self-paly

encoder

updated 9 days ago
Upvote
-

  • OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

    Paper • 2505.04601 • Published May 7 • 28
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs