Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7, 2025 • 64 • 4
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published Jun 17, 2025 • 45 • 8
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization Paper • 2504.13173 • Published Apr 17, 2025 • 18 • 4
PixelFlow: Pixel-Space Generative Models with Flow Paper • 2504.07963 • Published Apr 10, 2025 • 18 • 6
PixelFlow: Pixel-Space Generative Models with Flow Paper • 2504.07963 • Published Apr 10, 2025 • 18 • 6
Continuous Diffusion Model for Language Modeling Paper • 2502.11564 • Published Feb 17, 2025 • 53 • 4
ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features Paper • 2502.04320 • Published Feb 6, 2025 • 36 • 3
BTS: Harmonizing Specialized Experts into a Generalist LLM Paper • 2502.00075 • Published Jan 31, 2025 • 1 • 1
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Paper • 2501.18837 • Published Jan 31, 2025 • 10 • 5
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published Jan 10, 2025 • 75 • 6
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 287 • 44
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 108 • 8