ARPO - a dongguanting Collection

dongguanting 's Collections

ARPO

updated Dec 20, 2025

The official datasets and model checkpoints of ARPO

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161
dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29, 2025 • 15 • 2
dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated Dec 20, 2025 • 3 • 1
dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12, 2025 • 6 • 5
dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17, 2025 • 1.07k • 218 • 6
dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19, 2025 • 16 • 2
dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12, 2025 • 8 • 1
dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12, 2025 • 35 • 3
dongguanting/ARPO-SFT-54K

Viewer • Updated Oct 17, 2025 • 54.6k • 423 • 15
dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated Oct 17, 2025 • 10k • 218 • 4