arxiv:2510.25726
Yuzhen Huang
yuzhen17
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 7 hours ago
Scaling Latent Reasoning via Looped Language Models
authored
a paper
about 7 hours ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic,
and Long-Horizon Task Execution
upvoted
a
paper
about 13 hours ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic,
and Long-Horizon Task Execution