Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data Paper • 2506.00469 • Published May 31 • 3
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding Paper • 2504.00019 • Published Mar 27
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators Paper • 2403.03894 • Published Mar 6, 2024
Triple-Encoders: Representations That Fire Together, Wire Together Paper • 2402.12332 • Published Feb 19, 2024 • 2
Imagination is All You Need! Curved Contrastive Learning for Abstract Sequence Modeling Utilized on Long Short-Term Dialogue Planning Paper • 2211.07591 • Published Nov 14, 2022
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22, 2024 • 48