James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr
8B
•
Updated
•
1
All checkpoints for our work "Language Imbalance Driven Rewarding for Multilingual Self-improving", https://arxiv.org/abs/2410.08964