view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks 3 days ago • 12
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 7 items • Updated 4 days ago • 134
view article Article Building for an Open Future - our new partnership with Google Cloud 11 days ago • 45
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 238
view article Article Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness 18 days ago • 10
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8 • 4
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning 28 days ago • 67
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23 • 60
view article Article High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) Oct 13 • 16
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9 • 718