-
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper • 2601.16443 • Published • 19 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 104 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 43
Collections
Discover the best community collections!
Collections including paper arxiv:2601.20975
-
Open Deep Search: Democratizing Search with Open-source Reasoning Agents
Paper • 2503.20201 • Published • 48 -
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Spacer: Towards Engineered Scientific Inspiration
Paper • 2508.17661 • Published • 32 -
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Paper • 2509.01396 • Published • 58
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 181 -
DeepSeek-OCR 2: Visual Causal Flow
Paper • 2601.20552 • Published • 70 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
BMAM: Brain-inspired Multi-Agent Memory Framework
Paper • 2601.20465 • Published • 5
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
Endless Terminals: Scaling RL Environments for Terminal Agents
Paper • 2601.16443 • Published • 19 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
Scaling Embeddings Outperforms Scaling Experts in Language Models
Paper • 2601.21204 • Published • 104 -
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
Paper • 2601.18778 • Published • 43
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 181 -
DeepSeek-OCR 2: Visual Causal Flow
Paper • 2601.20552 • Published • 70 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
BMAM: Brain-inspired Multi-Agent Memory Framework
Paper • 2601.20465 • Published • 5
-
Open Deep Search: Democratizing Search with Open-source Reasoning Agents
Paper • 2503.20201 • Published • 48 -
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Spacer: Towards Engineered Scientific Inspiration
Paper • 2508.17661 • Published • 32 -
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
Paper • 2509.01396 • Published • 58
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100