QIUSI ZHAN's picture

2 4

QIUSI ZHAN

qiusizhan

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Removing RLHF Protections in GPT-4 via Fine-Tuning

authored a paper about 1 month ago

LLM Agents can Autonomously Hack Websites

authored a paper about 1 month ago

ConFiguRe: Exploring Discourse-level Chinese Figures of Speech

View all activity

Organizations

authored 6 papers about 1 month ago

Removing RLHF Protections in GPT-4 via Fine-Tuning

Paper • 2311.05553 • Published Nov 9, 2023

LLM Agents can Autonomously Hack Websites

Paper • 2402.06664 • Published Feb 6, 2024 • 3

ConFiguRe: Exploring Discourse-level Chinese Figures of Speech

Paper • 2209.07678 • Published Sep 16, 2022

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

Paper • 2406.01637 • Published Jun 2, 2024 • 2

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents

Paper • 2403.02691 • Published Mar 5, 2024

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31 • 12