Ruobing Xie's picture

1 22 6

Ruobing Xie

Ruobing-Xie

·

https://ruobingxie.github.io/

AI & ML interests

Recommender System; Large Language Model; Natural Language Processing; Information Retrieval

Recent Activity

upvoted an article 20 days ago

Why Did MiniMax M2 End Up as a Full Attention Model?

upvoted a paper 2 months ago

Why Language Models Hallucinate

upvoted a paper 4 months ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

View all activity

Organizations

None yet

authored a paper 6 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28 • 66

authored a paper 10 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44

authored a paper 11 months ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26

authored 9 papers about 1 year ago

HMoE: Heterogeneous Mixture of Experts for Language Modeling

Paper • 2408.10681 • Published Aug 20, 2024 • 10

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 46

PhD: A Prompted Visual Hallucination Evaluation Dataset

Paper • 2403.11116 • Published Mar 17, 2024 • 3

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

Paper • 2402.18439 • Published Feb 28, 2024 • 1

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

Paper • 2308.10848 • Published Aug 21, 2023 • 1

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Paper • 2307.16789 • Published Jul 31, 2023 • 101

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

Paper • 2403.08281 • Published Mar 13, 2024

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models

Paper • 2310.12818 • Published Oct 19, 2023

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published Nov 4, 2024 • 25

authored a paper over 1 year ago

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Paper • 2407.07061 • Published Jul 9, 2024 • 27