BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages Paper • 2406.09948 • Published Jun 14, 2024 • 2
Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues Paper • 2510.19028 • Published Oct 21, 2025 • 7
Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts Paper • 2510.24541 • Published Oct 28, 2025
Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues Paper • 2510.19028 • Published Oct 21, 2025 • 7
MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language Paper • 2505.14395 • Published May 20, 2025 • 6
When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun Paper • 2411.04822 • Published Nov 7, 2024
LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation Paper • 2503.07237 • Published Mar 10, 2025
HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja Paper • 2501.11951 • Published Jan 21, 2025
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean Paper • 2403.06412 • Published Mar 11, 2024 • 3
CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset Paper • 2406.15481 • Published Jun 17, 2024 • 1