arxiv:2505.22172
xiang huang
xianghuang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
ChARM: Character-based Act-adaptive Reward Modeling for Advanced
Role-Playing Language Agents
upvoted
a
paper
about 2 months ago
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM
Reasoning
upvoted
a
paper
about 2 months ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via
Reinforcement Learning
Organizations
None yet