facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction β’ 7B β’ Updated Aug 19 β’ 26.4k β’ 195
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 β’ 13 items β’ Updated Aug 21 β’ 398
openai/whisper-large-v3 Automatic Speech Recognition β’ 2B β’ Updated Aug 12, 2024 β’ 4.9M β’ β’ 5.17k
Runtime error 150 Multi Voice TTS(English/Chinese/Japanese) π 150 [δΈζ/English/ζ₯ζ¬θͺ]multilingual text-to-speech
Running Featured 362 Qwen2.5 Omni 7B Demo π 362 Generate text and speech from text, audio, images, and videos
openai/whisper-large-v3-turbo Automatic Speech Recognition β’ 0.8B β’ Updated Oct 4, 2024 β’ 4.57M β’ β’ 2.71k