ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13 • 3.77k • 816
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent Paper • 2312.08926 • Published Dec 14, 2023 • 10