Ki-Ung song's picture

15 5 8

Ki-Ung song

sk851

·

https://kiungsong.github.io

KiUngSong

AI & ML interests

Generative model / Multimodal

Recent Activity

upvoted a paper about 1 month ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

upvoted a paper 6 months ago

Seeing Voices: Generating A-Roll Video from Audio with Mirage

authored a paper 8 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

View all activity

Organizations

upvoted a paper about 1 month ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7 • 139

upvoted a paper 6 months ago

Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published Jun 9 • 27

authored a paper 8 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1 • 15