Everyday Physics in Korean Contexts: A Culturally Grounded Physical Reasoning Benchmark Paper • 2509.17807 • Published Sep 22 • 1
Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study Paper • 2505.15389 • Published May 21 • 8