Multi-Agent Evolve: LLM Self-Improve through Co-evolution Paper • 2510.23595 • Published 8 days ago • 8
Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs Paper • 2310.16355 • Published Oct 25, 2023
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 37
Toward Inference-optimal Mixture-of-Expert Large Language Models Paper • 2404.02852 • Published Apr 3, 2024
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published 15 days ago • 117
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published 22 days ago • 26
GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare Paper • 2510.08872 • Published 25 days ago • 2
TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs Paper • 2412.11242 • Published Dec 15, 2024 • 1
ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration Paper • 2502.00675 • Published Feb 2 • 2
GameArena: Evaluating LLM Reasoning through Live Computer Games Paper • 2412.06394 • Published Dec 9, 2024 • 1
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization Paper • 2406.05981 • Published Jun 10, 2024 • 16
When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models Paper • 2406.07368 • Published Jun 11, 2024 • 2
Efficiently Serving LLM Reasoning Programs with Certaindex Paper • 2412.20993 • Published Dec 30, 2024 • 37
Efficiently Serving LLM Reasoning Programs with Certaindex Paper • 2412.20993 • Published Dec 30, 2024 • 37