MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 6 days ago • 34
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 6 days ago • 67
Query-Kontext: An Unified Multimodal Model for Image Generation and Editing Paper • 2509.26641 • Published Sep 30, 2025 • 4
Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers Paper • 2603.27666 • Published 16 days ago • 18
Running on Zero Featured 10 StyleRenderer 🎨 10 Generate stylized video from game G‑buffer inputs
Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation Paper • 2604.03118 • Published 11 days ago • 6
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper • 2604.04911 • Published 8 days ago • 35
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 8 days ago • 104
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 8 days ago • 42