Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TianshengHuang's picture
1 4 1

TianshengHuang

TianshengHuang
Mi6paulino's profile picture Ahren09's profile picture JiarunShen's profile picture
·
https://huangtiansheng.github.io/
  • huangtiansheng
  • tiansheng-huang-5661a8293

AI & ML interests

LLM safety

Recent Activity

upvoted a paper 23 days ago
AgentReview: Exploring Peer Review Dynamics with LLM Agents
upvoted a paper 23 days ago
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
upvoted a paper 25 days ago
Large Reasoning Models Learn Better Alignment from Flawed Thinking
View all activity

Organizations

Georgia Institute of Technology's profile picture

commented 2 papers 9 months ago

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Paper • 2501.17433 • Published Jan 29 • 10 •
3

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Paper • 2501.17433 • Published Jan 29 • 10 •
3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs