Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published Nov 17, 2025 • 46
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper • 2501.13826 • Published Jan 23, 2025 • 23
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Paper • 2407.12772 • Published Jul 17, 2024 • 35
OtterHD: A High-Resolution Multi-modality Model Paper • 2311.04219 • Published Nov 7, 2023 • 34
MIMIC-IT: Multi-Modal In-Context Instruction Tuning Paper • 2306.05425 • Published Jun 8, 2023 • 11