Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Paper ⢠2509.26625 ⢠Published Sep 30 ⢠43
PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models Paper ⢠2412.18608 ⢠Published Dec 24, 2024 ⢠18
PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models Paper ⢠2412.18608 ⢠Published Dec 24, 2024 ⢠18 ⢠2
Running on T4 109 109 CountGD_Multi-Modal_Open-World_Counting š Count objects in images using text, visual examples, or both
SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds Paper ⢠2312.09246 ⢠Published Dec 14, 2023 ⢠9