Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ruisheng Cao's picture
21 3

Ruisheng Cao

rshcao
Gargaz's profile picture 21world's profile picture SteveSHEN's profile picture
·
https://rhythmcao.github.io
  • RuishengC49326
  • rhythmcao

AI & ML interests

NLP, multi-modal agents, semantic-parsing, text-to-SQL, code-generation, dialogue-system

Recent Activity

updated a dataset 25 days ago
xlangai/spider2v-trajs
new activity 2 months ago
SWE-bench-Live/SWE-bench-Live:Missing Docker Image
new activity 3 months ago
Qwen/Qwen3-Coder-480B-A35B-Instruct:Update chat_template and tool_parser
View all activity

Organizations

XLang NLP Lab's profile picture OpenDFM's profile picture SJTU Cross Media Language Intelligence Lab's profile picture

authored 5 papers over 1 year ago

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Paper • 2402.18262 • Published Feb 28, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Paper • 2407.10956 • Published Jul 15, 2024 • 7

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 50

Mobile-Env: An Evaluation Platform and Benchmark for Interactive Agents in LLM Era

Paper • 2305.08144 • Published May 14, 2023 • 1

CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset

Paper • 2305.15891 • Published May 25, 2023 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs