view article Article Process Reinforcement through Implicit Rewards By ganqu and 1 other β’ Jan 3 β’ 31
βοΈ Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices β’ 21 items β’ Updated 4 days ago β’ 88
view article Article AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models By imomayiz and 4 others β’ Sep 16 β’ 18
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. β’ 12 items β’ Updated 13 days ago β’ 45
view article Article Building Conversational AI: A Deep Dive into Voice Agent Architectures and Best Practices By abdeljalilELmajjodi β’ Sep 2 β’ 9
view article Article Introducing Marvis TTS: Real-Time Streaming Speech Synthesis By prince-canuma and 1 other β’ Aug 27 β’ 14
view article Article Luth: Efficient French Specialization for Small Language Models By MaxLSB and 1 other β’ Aug 11 β’ 17
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25 β’ 202
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18 β’ 85
view article Article Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning By damjan-k β’ Feb 20, 2024 β’ 29
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 15 items β’ Updated Apr 18 β’ 240