Anthony Peng's picture

3 9 6

Anthony Peng

AnthonyPeng

·

https://shengyun-peng.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

upvoted a paper 19 days ago

Agent Learning via Early Experience

upvoted a paper 22 days ago

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

View all activity

Organizations

upvoted 2 papers 19 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published 19 days ago • 41

Agent Learning via Early Experience

Paper • 2510.08558 • Published 19 days ago • 249

upvoted 2 papers 22 days ago

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

Paper • 2510.02286 • Published 26 days ago • 28

AgentReview: Exploring Peer Review Dynamics with LLM Agents

Paper • 2406.12708 • Published Jun 18, 2024 • 8

upvoted 3 papers 24 days ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published 27 days ago • 57

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 172

RobArch: Designing Robust Architectures against Adversarial Attacks

Paper • 2301.03110 • Published Jan 8, 2023 • 1

upvoted a paper 11 months ago

CompCap: Improving Multimodal Large Language Models with Composite Captions

Paper • 2412.05243 • Published Dec 6, 2024 • 20

upvoted a collection 12 months ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated 6 days ago • 147