Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails Paper • 2510.04860 • Published Oct 6 • 2 • 2
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Paper • 2410.10139 • Published Oct 14, 2024 • 52 • 4