1 70 12

Taekyung Ki

taekyungki

https://taekyungki.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Visual Diffusion Models are Geometric Solvers

upvoted a paper 9 days ago

WithAnyone: Towards Controllable and ID Consistent Image Generation

upvoted a paper 9 days ago

VISTA: A Test-Time Self-Improving Video Generation Agent

View all activity

Organizations

upvoted a paper 8 days ago

Visual Diffusion Models are Geometric Solvers

Paper • 2510.21697 • Published 10 days ago • 18

upvoted 5 papers 9 days ago

upvoted a paper 21 days ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published 21 days ago • 160

upvoted 2 papers 22 days ago

TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling

Paper • 2510.04533 • Published 29 days ago • 47

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Paper • 2510.09201 • Published 25 days ago • 47

upvoted a paper 24 days ago

Reinforcing Diffusion Models by Direct Group Preference Optimization

Paper • 2510.08425 • Published 25 days ago • 11

upvoted 5 papers about 1 month ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 91

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1 • 31

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Paper • 2509.17627 • Published Sep 22 • 65

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Paper • 2509.15937 • Published Sep 19 • 20

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19 • 45

upvoted a paper about 2 months ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Paper • 2509.09595 • Published Sep 11 • 48

liked a dataset about 2 months ago

IVLLab/MultiDialog

Updated Aug 29, 2024 • 577 • 26

upvoted a paper about 2 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10 • 126

liked a dataset about 2 months ago

ZiqiaoPeng/DualTalk_Dataset

Updated Jul 12 • 80 • 3

upvoted a paper 2 months ago

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Paper • 2508.13618 • Published Aug 19 • 17

Taekyung Ki

AI & ML interests

Recent Activity

Organizations

taekyungki's activity