VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models Paper • 2511.07299 • Published 14 days ago • 4 • 3
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning Paper • 2510.15110 • Published Oct 16 • 15 • 3
TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control Paper • 2510.09561 • Published Oct 10 • 7 • 2
Temporal Prompting Matters: Rethinking Referring Video Object Segmentation Paper • 2510.07319 • Published Oct 8 • 2 • 2
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models Paper • 2510.03232 • Published Oct 3 • 1 • 2
V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts Paper • 2509.18053 • Published Sep 22 • 3 • 3
V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models Paper • 2502.09980 • Published Feb 14 • 5 • 4
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published Jan 14 • 33 • 2
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation Paper • 2410.21271 • Published Oct 28, 2024 • 7 • 2