1 9 3

Jixuan Chen

Mayome

https://chenjix.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 22 days ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

authored a paper 22 days ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

authored a paper 22 days ago

OpenCUA: Open Foundations for Computer-Use Agents

View all activity

Organizations

authored 6 papers 22 days ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024 • 2

upvoted a paper 3 months ago

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 210

upvoted 3 papers 5 months ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 159

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Paper • 2512.01078 • Published Nov 30, 2025 • 34

upvoted a paper 7 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22, 2025 • 21

upvoted a paper 9 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12, 2025 • 33

liked a Space 10 months ago

RISEBench Gallery

👀

A Gallery of Generation Results on RISEBench

updated a dataset 11 months ago

xlangai/ubuntu_osworld_file_cache

Updated about 13 hours ago • 1.11M • 15

upvoted a paper 12 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19, 2025 • 46

authored a paper 12 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19, 2025 • 46

liked a Space 12 months ago

Open LMM Subjective Leaderboard

🌎

VLMEvalKit Subjectivce Benchmark Results

authored a paper about 1 year ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 61

upvoted a paper about 1 year ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25, 2025 • 35

liked a Space over 1 year ago

Open LMM Reasoning Leaderboard

🥇

A Leaderboard that demonstrates LMM reasoning capabilities

Jixuan Chen

AI & ML interests

Recent Activity

Organizations

Mayome's activity

RISEBench Gallery

Open LMM Subjective Leaderboard

Open LMM Reasoning Leaderboard