Monet: Reasoning in Latent Visual Space Beyond Images and Language Paper • 2511.21395 • Published 12 days ago • 15
Monet: Reasoning in Latent Visual Space Beyond Images and Language Paper • 2511.21395 • Published 12 days ago • 15
Monet: Reasoning in Latent Visual Space Beyond Images and Language Paper • 2511.21395 • Published 12 days ago • 15 • 2
MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs Paper • 2511.07250 • Published 28 days ago • 17
When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs Paper • 2511.02243 • Published Nov 4 • 24
When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs Paper • 2511.02243 • Published Nov 4 • 24
When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs Paper • 2511.02243 • Published Nov 4 • 24 • 1
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning Paper • 2510.14265 • Published Oct 16 • 19
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning Paper • 2510.14265 • Published Oct 16 • 19
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration Paper • 2510.10395 • Published Oct 12 • 29
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration Paper • 2510.10395 • Published Oct 12 • 29
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration Paper • 2510.10395 • Published Oct 12 • 29 • 2