Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenyuan Zhang's picture
1 3 1

Wenyuan Zhang

WYRipple
·
https://scholar.google.com/citations?user=5weUrvgAAAAJ&hl=zh-CN
  • WYRipple

AI & ML interests

LLM Social Agent, Role-playing-LLM, Dialogue

Organizations

None yet

WYRipple 's collections 2

SOTOPIA-Ω Checkpoints
ACL 2025 (main) paper -- SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents.
  • WYRipple/sotopia-omega_qwen2.5-7B-DSI

    Updated Feb 19
  • WYRipple/sotopia-omega_mistral-7B-DSI

    Updated Feb 19
  • WYRipple/sotopia-omega_llama3_8B_DSI

    Updated Feb 19 • 1
Benchmark paper
  • S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

    Paper • 2504.10368 • Published Apr 14 • 21
  • Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing

    Paper • 2409.11726 • Published Sep 18, 2024
SOTOPIA-Ω Checkpoints
ACL 2025 (main) paper -- SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents.
  • WYRipple/sotopia-omega_qwen2.5-7B-DSI

    Updated Feb 19
  • WYRipple/sotopia-omega_mistral-7B-DSI

    Updated Feb 19
  • WYRipple/sotopia-omega_llama3_8B_DSI

    Updated Feb 19 • 1
Benchmark paper
  • S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

    Paper • 2504.10368 • Published Apr 14 • 21
  • Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing

    Paper • 2409.11726 • Published Sep 18, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs