PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 12 days ago • 61
TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model Paper • 2510.16449 • Published 14 days ago • 34
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Paper • 2407.04693 • Published Jul 5, 2024 • 3
LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking Paper • 2407.04020 • Published Jul 4, 2024 • 4
Understanding Visual Feature Reliance through the Lens of Complexity Paper • 2407.06076 • Published Jul 8, 2024 • 7
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7, 2024 • 10
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System Paper • 2407.06027 • Published Jul 8, 2024 • 11
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper • 2407.06191 • Published Jul 8, 2024 • 14
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct Paper • 2407.05700 • Published Jul 8, 2024 • 14
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale Paper • 2407.05282 • Published Jul 7, 2024 • 15
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction Paper • 2407.03651 • Published Jul 4, 2024 • 18
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Paper • 2407.06135 • Published Jul 8, 2024 • 23
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations Paper • 2407.03471 • Published Jul 3, 2024 • 31
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 37
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper • 2407.04842 • Published Jul 5, 2024 • 56