2 105 84

lee dong ryeol

drlee1

DONGRYEOLLEE1

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

liked a model 7 days ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

upvoted a paper 7 days ago

Scaling Latent Reasoning via Looped Language Models

View all activity

Organizations

None yet

upvoted a paper 3 days ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published 9 days ago • 62

upvoted a paper 7 days ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 8 days ago • 201

upvoted a paper 14 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 16 days ago • 110

upvoted a paper 15 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 17 days ago • 117

upvoted a paper about 2 months ago

Memp: Exploring Agent Procedural Memory

Paper • 2508.06433 • Published Aug 8 • 34

upvoted 3 papers 2 months ago

upvoted an article 3 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

Aug 9

• 43

upvoted 4 papers 3 months ago

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 236

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 307

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 259

upvoted a paper 5 months ago

Large Language Models for Data Synthesis

Paper • 2505.14752 • Published May 20 • 49

upvoted 3 papers 6 months ago

CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images

Paper • 2310.16825 • Published Oct 25, 2023 • 36

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Paper • 2505.00358 • Published May 1 • 26

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88

upvoted 3 papers 7 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

Efficient Pretraining Length Scaling

Paper • 2504.14992 • Published Apr 21 • 20

Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17 • 59

lee dong ryeol

AI & ML interests

Recent Activity

Organizations

drlee1's activity

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍