4 31 216

PeijieDong

pprp

https://pprp.github.io

AI & ML interests

Model Compression; Large Language Model;

Recent Activity

liked a model 1 day ago

nvidia/Nemotron-Flash-1B

liked a dataset 17 days ago

Idavidrein/gpqa

liked a dataset 22 days ago

nvidia/Llama-Nemotron-VLM-Dataset-v1

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published Dec 1, 2025 • 88

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 282

upvoted a paper 3 months ago

Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution

Paper • 2509.21072 • Published Sep 25, 2025 • 15

upvoted a paper 4 months ago

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 101

upvoted a paper 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

upvoted a paper 7 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300

upvoted a collection 7 months ago

Small Language Models (SLMs)

Collection

<3B • 33 items • Updated Jun 11, 2025 • 1

upvoted a paper 7 months ago

Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression

Paper • 2505.19433 • Published May 26, 2025 • 5

upvoted a collection 8 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 179

upvoted a paper 9 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

upvoted a paper 11 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10, 2025 • 58

upvoted an article 11 months ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

May 2, 2022

•

upvoted a paper 11 months ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published Feb 6, 2025 • 4

upvoted an article 12 months ago

Article

Token Merging for fast LLM inference : Background and first trials with Mistral

Apr 30, 2024

•

upvoted 6 papers about 1 year ago

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

Paper • 2410.18785 • Published Oct 24, 2024 • 7

FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12, 2024 • 15

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 7

LPZero: Language Model Zero-cost Proxy Search from Zero

Paper • 2410.04808 • Published Oct 7, 2024 • 2

Benchmarking Agentic Workflow Generation

Paper • 2410.07869 • Published Oct 10, 2024 • 29

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Paper • 2410.05265 • Published Oct 7, 2024 • 33