Ki-Ung song's picture

15 5 8

Ki-Ung song

sk851

·

https://kiungsong.github.io

KiUngSong

AI & ML interests

Generative model / Multimodal

Recent Activity

upvoted a paper about 1 month ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

upvoted a paper 6 months ago

Seeing Voices: Generating A-Roll Video from Audio with Mirage

authored a paper 8 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

View all activity

Organizations

upvoted a paper about 1 month ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7 • 139

upvoted a paper 6 months ago

Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published Jun 9 • 27

upvoted a collection about 1 year ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 646

upvoted a collection over 1 year ago

Llama-3.1

4 items • Updated Oct 3, 2024 • 6

upvoted a collection almost 2 years ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 345