CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation Paper • 2506.10890 • Published Jun 12 • 9
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion? Paper • 2505.22129 • Published May 28 • 15
Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens Paper • 2503.08377 • Published Mar 11 • 2
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published Apr 1 • 93
Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper • 2503.19325 • Published Mar 25 • 73
ROICtrl: Boosting Instance Control for Visual Generation Paper • 2411.17949 • Published Nov 27, 2024 • 87
Described Object Detection: Liberating Object Detection with Flexible Expressions Paper • 2307.12813 • Published Jul 24, 2023 • 1