Sangwoo Park PRO

Jackson0018

https://psw0021.github.io/

AI & ML interests

natural language processing/Reinforcement Learning

Recent Activity

published a dataset 7 days ago

Jackson0018/Raw_Train_Dataset_Semantic_Scholar

published a dataset 7 days ago

Jackson0018/Final_Train_Set

updated a dataset 7 days ago

Jackson0018/Final_Train_Set

View all activity

Organizations

upvoted 4 papers 20 days ago

CWM: An Open-Weights LLM for Research on Code Generation with World Models

Paper • 2510.02387 • Published Sep 30 • 7

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 24 days ago • 173

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

Paper • 2510.03264 • Published Sep 26 • 23

Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces

Paper • 2510.06953 • Published 29 days ago • 7

upvoted 3 papers 21 days ago

upvoted a paper 25 days ago

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Paper • 2510.09201 • Published 27 days ago • 48

upvoted a paper 28 days ago

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published 29 days ago • 48

upvoted 2 papers about 1 month ago

Rethinking Reward Models for Multi-Domain Test-Time Scaling

Paper • 2510.00492 • Published Oct 1 • 27

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1 • 31

upvoted a collection 4 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.39k

upvoted 2 papers 4 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 308

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 47

upvoted a paper 5 months ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

upvoted 2 papers 6 months ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 71

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published Apr 29 • 61

upvoted 2 papers 7 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7 • 42

upvoted a paper 8 months ago

Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models

Paper • 2503.09669 • Published Mar 12 • 35

Sangwoo Park PRO

AI & ML interests

Recent Activity

Organizations

Jackson0018's activity