view article Article The Optimal Architecture for Small Language Models codelion • Dec 26, 2025 • 121
GLM-4.7-Flash-SynthLabs Collection SYNTH-like reasoning for GLM 4.7 Flash model • 4 items • Updated Jan 22 • 2
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders thomwolf, matthieu-lapeyre • Jul 9, 2025 • 800
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published Dec 18, 2025 • 97
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 626
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis Paper • 2506.06276 • Published Jun 6, 2025 • 26
Real-time Vision Models Collection A collection of real-time detectors. • 20 items • Updated Feb 18 • 23
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence Paper • 2509.03505 • Published Sep 3, 2025 • 7
Tiny Reasoning Language Model Collection Collection dedicated to the development of the Tiny Reasoning Language Model (trlm) • 7 items • Updated Jan 26 • 7
view article Article mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL driaforall • Sep 11, 2025 • 26
view article Article RexBERT: Encoders for a brave new world of E-Commerce thebajajra • Sep 20, 2025 • 50
🎯 Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated Apr 8 • 114
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 abidlabs, znation, nouamanetazi, sasha, qgallouedec • Jul 29, 2025 • 223
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 188
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego • Sep 4, 2025 • 274
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 645
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers tomaarsen, arthurbresnu • Jul 1, 2025 • 138
Lingshu MLLMs Collection Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning • 5 items • Updated Apr 8 • 21