Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning Paper • 2505.07538 • Published May 12
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens Paper • 2504.14666 • Published Apr 20
Expanding the Action Space of LLMs to Reason Beyond Language Paper • 2510.07581 • Published 26 days ago • 7
Expanding the Action Space of LLMs to Reason Beyond Language Paper • 2510.07581 • Published 26 days ago • 7
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 243
Exploring Diffusion Time-steps for Unsupervised Representation Learning Paper • 2401.11430 • Published Jan 21, 2024 • 1
Exploring Diffusion Time-steps for Unsupervised Representation Learning Paper • 2401.11430 • Published Jan 21, 2024 • 1