Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
macpaw-research 's Collections
NER Datasets
Agents

Agents

updated Sep 17

All about agents including models, datasets, evals

Upvote
-

  • Survey on Evaluation of LLM-based Agents

    Paper • 2503.16416 • Published Mar 20 • 95

  • Qwen2.5-Omni Technical Report

    Paper • 2503.20215 • Published Mar 26 • 166

  • Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

    Paper • 2504.01990 • Published Mar 31 • 300

  • LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

    Paper • 2403.13372 • Published Mar 20, 2024 • 159

  • Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

    Paper • 2504.13837 • Published Apr 18 • 135

  • Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

    Paper • 2504.19413 • Published Apr 28 • 20

  • The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

    Paper • 2506.06941 • Published Jun 7 • 15

  • Small Language Models are the Future of Agentic AI

    Paper • 2506.02153 • Published Jun 2 • 21

  • MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement

    Paper • 2506.15692 • Published May 27 • 2

  • τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

    Paper • 2406.12045 • Published Jun 17, 2024 • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs