2 15 8

zpysky1125

pyzhao

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

MiniMaxAI/MiniMax-M2.1

liked a dataset 4 days ago

MiniMaxAI/VIBE

upvoted a collection 11 days ago

VTP

View all activity

Organizations

upvoted a collection 11 days ago

VTP

Collection

Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 11 days ago • 39

upvoted a paper 11 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 12 days ago • 96

upvoted 3 articles about 2 months ago

Article

What makes good reasoning data

Oct 30

•

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

upvoted a paper 4 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 79

upvoted a paper 6 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 273

upvoted 3 papers 7 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 143

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published May 26 • 68

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23 • 61

upvoted 2 papers 8 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12 • 134

upvoted a paper 9 months ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 45

upvoted a paper 12 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 103

upvoted an article over 1 year ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

•

106

zpysky1125

AI & ML interests

Recent Activity

Organizations

pyzhao's activity

What makes good reasoning data

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Why Did MiniMax M2 End Up as a Full Attention Model?

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models