Agents - a macpaw-research Collection

macpaw-research 's Collections

Agents

Agents

updated Sep 17

All about agents including models, datasets, evals

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 95
Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 159
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 135
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28 • 20
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Paper • 2506.06941 • Published Jun 7 • 15
Small Language Models are the Future of Agentic AI

Paper • 2506.02153 • Published Jun 2 • 21
MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement

Paper • 2506.15692 • Published May 27 • 2
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Paper • 2406.12045 • Published Jun 17, 2024 • 9