arxiv:2509.02534
Jason Weston
spermwhale
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
upvoted
a
paper
26 days ago
The Era of Real-World Human Interaction: RL from User Conversations
upvoted
a
paper
about 2 months ago
The Majority is not always right: RL training for solution aggregation
Organizations
None yet