- 
	
	
	
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 - 
	
	
	
PaSa: An LLM Agent for Comprehensive Academic Paper Search
Paper • 2501.10120 • Published • 52 - 
	
	
	
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Paper • 2501.09775 • Published • 33 - 
	
	
	
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario
Paper • 2501.10132 • Published • 22 
Collections
Discover the best community collections!
Collections including paper arxiv:2501.10893 
						
					
				- 
	
	
	
Agentless: Demystifying LLM-based Software Engineering Agents
Paper • 2407.01489 • Published • 64 - 
	
	
	
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Paper • 2410.12361 • Published - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 
- 
	
	
	
Video Creation by Demonstration
Paper • 2412.09551 • Published • 9 - 
	
	
	
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Paper • 2412.07589 • Published • 48 - 
	
	
	
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Paper • 2412.06531 • Published • 72 - 
	
	
	
APOLLO: SGD-like Memory, AdamW-level Performance
Paper • 2412.05270 • Published • 38 
- 
	
	
	
VILA^2: VILA Augmented VILA
Paper • 2407.17453 • Published • 41 - 
	
	
	
Octopus v4: Graph of language models
Paper • 2404.19296 • Published • 118 - 
	
	
	
Octo-planner: On-device Language Model for Planner-Action Agents
Paper • 2406.18082 • Published • 48 - 
	
	
	
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Paper • 2408.15518 • Published • 42 
- 
	
	
	
Agentless: Demystifying LLM-based Software Engineering Agents
Paper • 2407.01489 • Published • 64 - 
	
	
	
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Paper • 2410.12361 • Published - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 
- 
	
	
	
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
Paper • 2501.11733 • Published • 28 - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 - 
	
	
	
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 113 
- 
	
	
	
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 94 - 
	
	
	
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102 - 
	
	
	
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 108 - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 
- 
	
	
	
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper • 2409.02097 • Published • 34 - 
	
	
	
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper • 2409.11406 • Published • 27 - 
	
	
	
Diffusion Models Are Real-Time Game Engines
Paper • 2408.14837 • Published • 126 - 
	
	
	
Segment Anything with Multiple Modalities
Paper • 2408.09085 • Published • 22 
- 
	
	
	
End-to-End Goal-Driven Web Navigation
Paper • 1602.02261 • Published - 
	
	
	
Learning Language Games through Interaction
Paper • 1606.02447 • Published - 
	
	
	
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published - 
	
	
	
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published • 1 
- 
	
	
	
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 - 
	
	
	
PaSa: An LLM Agent for Comprehensive Academic Paper Search
Paper • 2501.10120 • Published • 52 - 
	
	
	
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Paper • 2501.09775 • Published • 33 - 
	
	
	
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario
Paper • 2501.10132 • Published • 22 
- 
	
	
	
Agentless: Demystifying LLM-based Software Engineering Agents
Paper • 2407.01489 • Published • 64 - 
	
	
	
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Paper • 2410.12361 • Published - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 
- 
	
	
	
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks
Paper • 2501.11733 • Published • 28 - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 - 
	
	
	
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 113 
- 
	
	
	
Agentless: Demystifying LLM-based Software Engineering Agents
Paper • 2407.01489 • Published • 64 - 
	
	
	
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Paper • 2410.12361 • Published - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 
- 
	
	
	
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 94 - 
	
	
	
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102 - 
	
	
	
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 108 - 
	
	
	
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26 
- 
	
	
	
Video Creation by Demonstration
Paper • 2412.09551 • Published • 9 - 
	
	
	
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Paper • 2412.07589 • Published • 48 - 
	
	
	
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation
Paper • 2412.06531 • Published • 72 - 
	
	
	
APOLLO: SGD-like Memory, AdamW-level Performance
Paper • 2412.05270 • Published • 38 
- 
	
	
	
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper • 2409.02097 • Published • 34 - 
	
	
	
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper • 2409.11406 • Published • 27 - 
	
	
	
Diffusion Models Are Real-Time Game Engines
Paper • 2408.14837 • Published • 126 - 
	
	
	
Segment Anything with Multiple Modalities
Paper • 2408.09085 • Published • 22 
- 
	
	
	
VILA^2: VILA Augmented VILA
Paper • 2407.17453 • Published • 41 - 
	
	
	
Octopus v4: Graph of language models
Paper • 2404.19296 • Published • 118 - 
	
	
	
Octo-planner: On-device Language Model for Planner-Action Agents
Paper • 2406.18082 • Published • 48 - 
	
	
	
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models
Paper • 2408.15518 • Published • 42 
- 
	
	
	
End-to-End Goal-Driven Web Navigation
Paper • 1602.02261 • Published - 
	
	
	
Learning Language Games through Interaction
Paper • 1606.02447 • Published - 
	
	
	
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published - 
	
	
	
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published • 1