Augmenting Unsupervised Reinforcement Learning with Self-Reference Paper • 2311.09692 • Published Nov 16, 2023 • 1
Provable General Function Class Representation Learning in Multitask Bandits and MDPs Paper • 2205.15701 • Published May 31, 2022
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones Paper • 2211.09703 • Published Nov 17, 2022
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing Paper • 2407.08770 • Published Jul 11, 2024 • 21
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published Nov 4, 2024 • 34
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published Apr 18 • 135