DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation Paper • 2510.14949 • Published Oct 16 • 5
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making Paper • 2410.07166 • Published Oct 9, 2024 • 3