arxiv:2508.20478
X
Phoebe13
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
28 days ago
Random Policy Valuation is Enough for LLM Reasoning with Verifiable
Rewards
authored
a paper
about 2 months ago
Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding
upvoted
a
paper
about 2 months ago
Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding
Organizations
None yet