9 22 10

Ziqi Huang

Ziqi

https://ziqihuangg.github.io/

AI & ML interests

Computer Vision, Generative Model, Image Generation, Video Generation, World Model

Recent Activity

upvoted a paper 9 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 14 days ago

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

liked a Space 20 days ago

worldbench/WorldLens

View all activity

Organizations

upvoted a paper 9 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 10 days ago • 61

upvoted a paper 14 days ago

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Paper • 2512.11464 • Published 20 days ago • 12

liked a Space 20 days ago

WorldLens

🥇

Duplicate this leaderboard to initialize your own!

authored 3 papers about 1 month ago

upvoted a paper about 1 month ago

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Paper • 2511.13648 • Published Nov 17, 2025 • 52

upvoted a paper about 2 months ago

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published Nov 11, 2025 • 29

commented a paper about 2 months ago

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published Nov 11, 2025 • 29 •

authored a paper 2 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 26

upvoted a paper 2 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 26

upvoted a paper 3 months ago

RealDPO: Real or Not Real, that is the Preference

Paper • 2510.14955 • Published Oct 16, 2025 • 6

commented a paper 3 months ago

RealDPO: Real or Not Real, that is the Preference

Paper • 2510.14955 • Published Oct 16, 2025 • 6 •

upvoted a paper 3 months ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15, 2025 • 9

authored 6 papers 3 months ago

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 9

Talk-to-Edit: Fine-Grained Facial Editing via Dialog

Paper • 2109.04425 • Published Sep 9, 2021

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Paper • 2501.08453 • Published Jan 14, 2025 • 1

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Paper • 2506.21356 • Published Jun 26, 2025 • 22

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published Aug 11, 2025 • 13

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published Aug 21, 2025 • 20

Ziqi Huang

AI & ML interests

Recent Activity

Organizations

Ziqi's activity

WorldLens