Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense Paper • 2510.07242 • Published 21 days ago • 30
PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs Paper • 2508.17188 • Published Aug 24 • 17
Flora: Low-Rank Adapters Are Secretly Gradient Compressors Paper • 2402.03293 • Published Feb 5, 2024 • 6