InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training Paper • 2007.07834 • Published Jul 15, 2020
XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders Paper • 2012.15547 • Published Dec 31, 2020
FlashBack:Efficient Retrieval-Augmented Language Modeling for Long Context Inference Paper • 2405.04065 • Published May 7, 2024
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment Paper • 2106.06381 • Published Jun 11, 2021
On the Representation Collapse of Sparse Mixture of Experts Paper • 2204.09179 • Published Apr 20, 2022 • 1
Language Is Not All You Need: Aligning Perception with Language Models Paper • 2302.14045 • Published Feb 27, 2023
Think Only When You Need with Large Hybrid-Reasoning Models Paper • 2505.14631 • Published May 20 • 20
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published 25 days ago • 26