Vchitect

non-profit

https://vchitect.intern-ai.org.cn/

Vchitect

Activity Feed Request to join this org

AI & ML interests

generative models, video generation

Recent Activity

weepiess2383 authored a paper about 3 hours ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Jianxiong updated a model 3 days ago

Vchitect/LongVie2

Jianxiong new activity 3 days ago

Vchitect/LongVie2:Create README.md

View all activity

Papers

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

View all Papers

weepiess2383

authored a paper about 3 hours ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 3 days ago • 60

Jianxiong

updated a model 3 days ago

Vchitect/LongVie2

Image-to-Video • Updated 3 days ago • 19

Jianxiong

in Vchitect/LongVie2 3 days ago

Create README.md

#1 opened 5 days ago by

Add model card for LongVie 2

#2 opened 5 days ago by

ynhe

updated a dataset 3 days ago

Vchitect/VBench-2.0_human_annotation

Preview • Updated 3 days ago • 100 • 1

ynhe

updated a dataset 4 days ago

Vchitect/VBench_human_annotation

Preview • Updated 4 days ago • 42 • 1

Jianxiong

authored 2 papers 7 days ago

Veila: Panoramic LiDAR Generation from a Monocular RGB Image

Paper • 2508.03690 • Published Aug 5

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 10 days ago • 70

zhengli1013

authored a paper 8 days ago

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

Paper • 2512.08294 • Published 17 days ago • 17

ChenyangSi

authored 7 papers 10 days ago

CoS: Chain-of-Shot Prompting for Long Video Understanding

Paper • 2502.06428 • Published Feb 10 • 10

FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model

Paper • 2507.01953 • Published Jul 2 • 18

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5 • 51

SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus

Paper • 2510.03160 • Published Oct 3 • 4

RealDPO: Real or Not Real, that is the Preference

Paper • 2510.14955 • Published Oct 16 • 6

DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation

Paper • 2512.02931 • Published 23 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 10 days ago • 70

Jianxiong

submitted a paper to Daily Papers 10 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 10 days ago • 70

Jianxiong

published a model 10 days ago

Vchitect/LongVie2

Image-to-Video • Updated 3 days ago • 19

zhengli1013

authored a paper 18 days ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 20 days ago • 38

syingxi

updated a dataset 19 days ago

Vchitect/VBench_sampled_video

Viewer • Updated 19 days ago • 200 • 3.95k • 1