1 67 30

Kyu Song

kyunocap

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

upvoted a paper about 1 month ago

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

upvoted 5 papers about 1 month ago

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Paper • 2510.27684 • Published Oct 31 • 22

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published Nov 5 • 51

upvoted a paper about 2 months ago

Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

Paper • 2510.19808 • Published Oct 22 • 28

upvoted 6 papers 2 months ago

Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published Sep 30 • 33

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

ReviewScore: Misinformed Peer Review Detection with Large Language Models

Paper • 2509.21679 • Published Sep 25 • 63

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 80

SD3.5-Flash: Distribution-Guided Distillation of Generative Flows

Paper • 2509.21318 • Published Sep 25 • 10

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24 • 98

upvoted 2 papers 3 months ago

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Paper • 2509.16117 • Published Sep 19 • 21

RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10 • 73

upvoted 6 papers 4 months ago

Waver: Wave Your Way to Lifelike Video Generation

Paper • 2508.15761 • Published Aug 21 • 34

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18, 2024 • 23

DINOv3

Paper • 2508.10104 • Published Aug 13 • 285

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published Aug 11 • 13

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6 • 58

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

Kyu Song

AI & ML interests

Recent Activity

Organizations

kyunocap's activity