Running on Zero Featured 347 Qwen Image Layered 🚀 347 Decompose an image into layers and export as PPTX or ZIP
Running 14 The Jagged AI Frontier is a Data Frontier 🧭 14 Why AI capabilities are shaped by data availability
mlx-community/SmolVLM2-500M-Video-Instruct-mlx Video-Text-to-Text • Updated Feb 20, 2025 • 1.31k • 18
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 508
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? +2 Jul 23, 2025 • 46
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20, 2025 • 29
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance Paper • 2502.06145 • Published Feb 10, 2025 • 18
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 247