X's picture

2

X

Phoebe13

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

authored a paper about 2 months ago

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

upvoted a paper about 2 months ago

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

View all activity

Organizations

None yet

Papers 1

arxiv:2508.20478

models 15

Phoebe13/Video-MTR

Visual Question Answering • 8B • Updated Sep 3 • 38 • 7

Phoebe13/Qwen-2.5-7B-Instruct_Explore0.5_30k_stage234_v1.2_ev_handcomp_simple_with_handtype

Phoebe13/Qwen-2.5-7B-Instruct_Explore0.5_30k_stage234_v1.2_ev_handcomp_simple

Phoebe13/Qwen-2.5-7B-Instruct_Explore0.5_30k_stage234_ev_handcomp_simple

Phoebe13/Qwen-2.5-7B-Instruct_Explore0.25_12k_stage234_ev_handcomp_simple

Phoebe13/Qwen-2.5-7B-Instruct-Poker-30k_stage234_ev-by-handcomp-simple

Phoebe13/Qwen-2.5-7B-Instruct-Poker-30k_stage1234_ev-by-handcomp-simple

Phoebe13/Qwen-2.5-7B-Instruct-Poker-16k_stage234_ev-by-handcomp-simple

Phoebe13/Qwen-2.5-7B-Instruct-Poker-ev-by-handcomp-simple

Phoebe13/Qwen-2.5-7B-Poker-RL-StrictFormat-ev-by-handcomp-simple

datasets 0

None public yet