1 2

Yuquan Xie

xieyuquan

xieyuquanxx

AI & ML interests

LLM, multi-modal

Recent Activity

authored a paper 5 days ago

Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

authored a paper 5 days ago

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

authored a paper 5 days ago

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

View all activity

Organizations

Collections 5

View 5 collections

Papers 5

spaces 1

狼人杀Agent示例

🚀

Create and compete with AI Agents inWerewolf and Spy games

models 3

datasets 3

xieyuquan/AppApksForAndroid

Updated 6 days ago • 3

xieyuquan/google_apps_step3000_historyimageFalse_uitars_actionspace

Viewer • Updated Mar 18 • 3k • 29 • 1

xieyuquan/google_apps_step3000_historyimageFalse

Viewer • Updated Mar 18 • 3k • 14

Yuquan Xie

AI & ML interests

Recent Activity

Organizations

Collections 5

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Understanding and Diagnosing Deep Reinforcement Learning

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Understanding and Diagnosing Deep Reinforcement Learning

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Papers 5

spaces 1

狼人杀Agent示例

models 3

xieyuquan/Optimus3-Policy

xieyuquan/Optimus3-Task-Router

xieyuquan/Optimus3-32B-SFT

datasets 3

xieyuquan/AppApksForAndroid

xieyuquan/google_apps_step3000_historyimageFalse_uitars_actionspace

xieyuquan/google_apps_step3000_historyimageFalse

Yuquan Xie

AI & ML interests

Recent Activity

Organizations

Collections 5

Papers 5

spaces 1

狼人杀Agent示例

models 3 Sort: Recently updated

datasets 3 Sort: Recently updated

models 3

datasets 3