LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation Paper • 2510.11063 • Published Oct 13 • 1
On the Faithfulness of Visual Thinking: Measurement and Enhancement Paper • 2510.23482 • Published Oct 27
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient Paper • 2406.10576 • Published Jun 15, 2024
PP-MobileSeg: Explore the Fast and Accurate Semantic Segmentation Model on Mobile Devices Paper • 2304.05152 • Published Apr 11, 2023
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16 • 98
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation Paper • 2507.05948 • Published Jul 8 • 1
Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning Paper • 2411.10928 • Published Nov 17, 2024
A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations Paper • 2502.14881 • Published Feb 14 • 2
Geometric Knowledge-Guided Localized Global Distribution Alignment for Federated Learning Paper • 2503.06457 • Published Mar 9
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published Apr 22 • 12
Backdoor Cleaning without External Guidance in MLLM Fine-tuning Paper • 2505.16916 • Published May 22 • 17
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model Paper • 2503.04543 • Published Mar 6 • 1
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning Paper • 2507.17512 • Published Jul 23 • 36
MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Paper • 2501.11299 • Published Jan 20
MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation Paper • 2110.10953 • Published Oct 21, 2021
Dual Structure-Aware Image Filterings for Semi-supervised Medical Image Segmentation Paper • 2312.07264 • Published Dec 12, 2023