phoenixbai's picture

phoenixbai

phoenixbai

·

phoenixbai

AI & ML interests

None yet

Organizations

None yet

upvoted 5 papers 5 months ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 51

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 94

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 268

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25, 2025 • 43

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

upvoted an article 6 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

MiniMax-AI

•

Oct 30, 2025

• 80

upvoted a paper about 1 year ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3, 2025 • 58

upvoted an article about 1 year ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Isayoften

•

Aug 26, 2024

• 91