mradermacher/Self-Certainty-Qwen3-1.7B-Base-MATH-GGUF Reinforcement Learning • 2B • Updated Oct 11, 2025 • 163 • 1