Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published 14 days ago • 48
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 24 days ago • 457
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 216
The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation Paper • 2412.04318 • Published Dec 5, 2024 • 1
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning Paper • 2509.01644 • Published Sep 1 • 33
Predicting the Order of Upcoming Tokens Improves Language Modeling Paper • 2508.19228 • Published Aug 26 • 22
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7 • 177
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28 • 81
Pixels, Patterns, but No Poetry: To See The World like Humans Paper • 2507.16863 • Published Jul 21 • 68
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78