mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated Mar 11 β’ 1.11M β’ 875
Running on Zero Agents Featured 1.95k Qwen3-TTS Demo π 1.95k Generate speech from text using voice design, cloning or presets
Running Featured 131 Ministral WebGPU β‘ 131 Frontier multimodal AI, running entirely in your browser.
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard π 1.02k VLMEvalKit Evaluation Results Collection
Running on Zero MCP 413 Multimodal OCR π 413 Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17, 2025 β’ 24.1k β’ 1.61k
Running on Zero Agents Featured 1.78k Dia 1.6B π― 1.78k Generate realistic dialogue from a script, using Dia!