adamsecada
's Collections
Favorites
updated
Bootstrapping Language Models with DPO Implicit Rewards
Paper
•
2406.09760
•
Published
•
40
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
•
2406.11931
•
Published
•
65
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs
Paper
•
2406.14544
•
Published
•
35
Instruction Pre-Training: Language Models are Supervised Multitask
Learners
Paper
•
2406.14491
•
Published
•
95
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
•
2406.04692
•
Published
•
59
CRAG -- Comprehensive RAG Benchmark
Paper
•
2406.04744
•
Published
•
48
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for
Large Language Models
Paper
•
2406.12644
•
Published
•
5
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
71
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
Paper
•
2404.16873
•
Published
•
29
LLM Agents can Autonomously Hack Websites
Paper
•
2402.06664
•
Published
•
3
Negotiating with LLMS: Prompt Hacks, Skill Gaps, and Reasoning Deficits
Paper
•
2312.03720
•
Published
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of
LLMs through a Global Scale Prompt Hacking Competition
Paper
•
2311.16119
•
Published
•
2
On the Exploitability of Instruction Tuning
Paper
•
2306.17194
•
Published
•
9
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Paper
•
2406.01637
•
Published
•
2
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
•
2407.01370
•
Published
•
89
Imagine yourself: Tuning-Free Personalized Image Generation
Paper
•
2409.13346
•
Published
•
70
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
140
LLMs + Persona-Plug = Personalized LLMs
Paper
•
2409.11901
•
Published
•
34
Seed-Music: A Unified Framework for High Quality and Controlled Music
Generation
Paper
•
2409.09214
•
Published
•
53
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Paper
•
2410.03017
•
Published
•
29
Unbounded: A Generative Infinite Game of Character Life Simulation
Paper
•
2410.18975
•
Published
•
37
A Survey of Small Language Models
Paper
•
2410.20011
•
Published
•
46
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy
Segment Optimization
Paper
•
2410.21411
•
Published
•
19
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in
Agentic Tasks
Paper
•
2502.08235
•
Published
•
58
MAPS: A Multi-Agent Framework Based on Big Seven Personality and
Socratic Guidance for Multimodal Scientific Problem Solving
Paper
•
2503.16905
•
Published
•
54
Efficient Agents: Building Effective Agents While Reducing Cost
Paper
•
2508.02694
•
Published
•
85
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning
in LLMs
Paper
•
2506.15211
•
Published
•
37
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent
Distillation and Agentic RL
Paper
•
2508.13167
•
Published
•
127
Prompt Orchestration Markup Language
Paper
•
2508.13948
•
Published
•
48
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic
Data and Scalable Reinforcement Learning
Paper
•
2509.13305
•
Published
•
88
Less is More: Recursive Reasoning with Tiny Networks
Paper
•
2510.04871
•
Published
•
451
Agent Learning via Early Experience
Paper
•
2510.08558
•
Published
•
246