Haohan Wang's picture

1

Haohan Wang

haohanw

·

https://haohanwang.github.io/

AI & ML interests

trustworthy ML, computational biology

Organizations

authored a paper 8 months ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30 • 9

authored 2 papers about 1 year ago

Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks

Paper • 2401.17263 • Published Jan 30, 2024 • 1

GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models

Paper • 2402.03299 • Published Feb 5, 2024 • 1