Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
QIUSI ZHAN's picture
2 4

QIUSI ZHAN

qiusizhan
·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago
Removing RLHF Protections in GPT-4 via Fine-Tuning
authored a paper about 1 month ago
LLM Agents can Autonomously Hack Websites
authored a paper about 1 month ago
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech
View all activity

Organizations

University of Illinois at Urbana-Champaign's profile picture uiuc-backdoor-attack's profile picture

authored 6 papers about 1 month ago

Removing RLHF Protections in GPT-4 via Fine-Tuning

Paper • 2311.05553 • Published Nov 9, 2023

LLM Agents can Autonomously Hack Websites

Paper • 2402.06664 • Published Feb 6, 2024 • 3

ConFiguRe: Exploring Discourse-level Chinese Figures of Speech

Paper • 2209.07678 • Published Sep 16, 2022

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

Paper • 2406.01637 • Published Jun 2, 2024 • 2

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents

Paper • 2403.02691 • Published Mar 5, 2024

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs