Toto: Time Series Optimized Transformer for Observability Paper • 2407.07874 • Published Jul 10, 2024 • 34
A decoder-only foundation model for time-series forecasting Paper • 2310.10688 • Published Oct 14, 2023 • 7
Running on CPU Upgrade Featured 2.37k The Smol Training Playbook 📚 Featured 2.37k The secrets to building world-class LLMs
Running Featured 168 Gradio Hackathon Registration Winter 25 📝 Featured 168 Gradio Agents & MCP Hackathon Winter 2025 Registration Page
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization Paper • 2509.23202 • Published Sep 27 • 27
Made with Jean Zay Collection Work performed using Jean Zay Supercomputer resources from GENCI-IDRIS • 4 items • Updated 26 days ago
AION-1: Omnimodal Foundation Model for Astronomical Sciences Paper • 2510.17960 • Published Oct 20 • 28