metadata
license: apache-2.0
pipeline_tag: any-to-any
HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation
Paper: https://arxiv.org/pdf/2506.02975
Code: https://github.com/Tencent/HaploVLM/tree/main/haploomni
license: apache-2.0
pipeline_tag: any-to-any
HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation
Paper: https://arxiv.org/pdf/2506.02975
Code: https://github.com/Tencent/HaploVLM/tree/main/haploomni