A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
Paper
•
2512.16978
•
Published
•
4
Natural Language Processing, Machine Learning, and Computer Vision
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
Robust and Calibrated Detection of Authentic Multimedia Content