Encoder-Free Human Motion Understanding via Structured Motion Descriptions Paper • 2604.21668 • Published 12 days ago • 2
Encoder-Free Human Motion Understanding via Structured Motion Descriptions Paper • 2604.21668 • Published 12 days ago • 2
Encoder-Free Human Motion Understanding via Structured Motion Descriptions Paper • 2604.21668 • Published 12 days ago • 2
Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment Paper • 2604.00913 • Published Apr 1 • 4
Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment Paper • 2604.00913 • Published Apr 1 • 4
Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment Paper • 2604.00913 • Published Apr 1 • 4 • 3
Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment Paper • 2604.00913 • Published Apr 1 • 4
NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval Paper • 2603.12824 • Published Mar 13 • 5