Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published Aug 29 • 28
TALKPLAY: Multimodal Music Recommendation with Large Language Models Paper • 2502.13713 • Published Feb 19 • 3
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation • 8B • Updated Oct 29, 2024 • 14.3k • 678
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 367