-
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 100 -
Robot Learning from a Physical World Model
Paper • 2511.07416 • Published • 28 -
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Paper • 2511.06805 • Published • 12 -
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms
Paper • 2511.17592 • Published • 117
Harihara Valliappan
HarishValliappan
·
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
RL
updated
a collection
3 days ago
RL
upvoted
a
paper
3 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Organizations
None yet