2 12 3

Changdae Oh

changdae

https://changdaeoh.github.io/

AI & ML interests

AI Robustness; Distribution Shift; Model Editing; Efficient Fine-tuning

Recent Activity

upvoted an article 12 days ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

upvoted a paper about 2 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

authored a paper about 2 months ago

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

View all activity

Organizations

None yet

upvoted an article 12 days ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11

•

upvoted a paper about 2 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Paper • 2510.07242 • Published Oct 8 • 30

authored a paper about 2 months ago

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27 • 14

upvoted 4 papers about 2 months ago

Infusing Theory of Mind into Socially Intelligent LLM Agents

Paper • 2509.22887 • Published Sep 26 • 5

LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge Signals

Paper • 2509.21875 • Published Sep 26 • 9

Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment

Paper • 2509.23564 • Published Sep 28 • 7

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27 • 14

commented a paper about 2 months ago

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Paper • 2509.23050 • Published Sep 27 • 14 •

upvoted a paper 6 months ago

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Paper • 2505.18943 • Published May 25 • 24

published a dataset 6 months ago

changdae/llavabench-shift-natural-v1

Viewer • Updated May 26 • 1.47k • 121

updated a dataset 6 months ago

changdae/llavabench-shift-natural-v1

Viewer • Updated May 26 • 1.47k • 121

published a dataset 6 months ago

changdae/llavabench-shift-synthetic-v1

Updated May 26 • 37

updated a dataset 6 months ago

changdae/llavabench-shift-synthetic-v1

Updated May 26 • 37

authored 5 papers 6 months ago

Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach

Paper • 2502.00577 • Published Feb 1

upvoted a paper 6 months ago

Visual Instruction Bottleneck Tuning

Paper • 2505.13946 • Published May 20 • 10

commented a paper 6 months ago

Visual Instruction Bottleneck Tuning

Paper • 2505.13946 • Published May 20 • 10 •

Changdae Oh

AI & ML interests

Recent Activity

Organizations

changdae's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment