ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle Paper • 2507.12674 • Published Jul 16
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint Paper • 2505.23759 • Published May 29 • 5
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery Paper • 2505.02829 • Published May 5
See, Say, and Segment: Teaching LMMs to Overcome False Premises Paper • 2312.08366 • Published Dec 13, 2023
Visual Haystacks: Answering Harder Questions About Sets of Images Paper • 2407.13766 • Published Jul 18, 2024 • 2
CLAIR-A: Leveraging Large Language Models to Judge Audio Captions Paper • 2409.12962 • Published Sep 19, 2024 • 2
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published Apr 17 • 39
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Paper • 2305.14334 • Published May 23, 2023 • 1