--- license: apache-2.0 pipeline_tag: any-to-any --- HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation Paper: https://arxiv.org/pdf/2506.02975 Code: https://github.com/Tencent/HaploVLM/tree/main/haploomni