Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2507.02592

Useful Resources

about 1 month ago

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20 • 59
WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 33
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16 • 77
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16 • 88

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 306
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 258
DINOv3

Paper • 2508.10104 • Published Aug 13 • 274

Search indexing

WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 33
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

Agentic & Multi-turn Chat

Literature for evaluating agents and multi-turn chat. Blogs: https://arize.com/blog/prompt-learning-using-english-feedback-to-optimize-llm-systems

CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs

Paper • 2408.02193 • Published Aug 5, 2024 • 1
google/frames-benchmark

Viewer • Updated Oct 15, 2024 • 824 • 11.6k • 229
gaia-benchmark/GAIA

Viewer • Updated 1 day ago • 932 • 6.54k • 471
callanwu/WebWalkerQA

Viewer • Updated Sep 8 • 14.3k • 5.65k • 44

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 274
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 236
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 257

WebAgent for Information Seeking built by Tongyi Lab

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20 • 59
WebWalker: Benchmarking LLMs in Web Traversal

Paper • 2501.07572 • Published Jan 13 • 22
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120
WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 33

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Paper • 2507.03112 • Published Jul 3 • 31
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published Jul 2 • 38
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Paper • 2506.17218 • Published Jun 20 • 29
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Paper • 2506.10821 • Published Jun 12 • 19
Jan-nano Technical Report

Paper • 2506.22760 • Published Jun 28 • 9
MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 64
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

Useful Resources

about 1 month ago

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20 • 59
WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 33
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16 • 77
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16 • 88

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 274
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 236
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 257

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 306
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 258
DINOv3

Paper • 2508.10104 • Published Aug 13 • 274

WebAgent for Information Seeking built by Tongyi Lab

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20 • 59
WebWalker: Benchmarking LLMs in Web Traversal

Paper • 2501.07572 • Published Jan 13 • 22
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120
WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 33

Search indexing

WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 33
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Paper • 2507.03112 • Published Jul 3 • 31
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published Jul 2 • 38
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Paper • 2506.17218 • Published Jun 20 • 29
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

Agentic & Multi-turn Chat

Literature for evaluating agents and multi-turn chat. Blogs: https://arize.com/blog/prompt-learning-using-english-feedback-to-optimize-llm-systems

CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs

Paper • 2408.02193 • Published Aug 5, 2024 • 1
google/frames-benchmark

Viewer • Updated Oct 15, 2024 • 824 • 11.6k • 229
gaia-benchmark/GAIA

Viewer • Updated 1 day ago • 932 • 6.54k • 471
callanwu/WebWalkerQA

Viewer • Updated Sep 8 • 14.3k • 5.65k • 44

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Paper • 2506.10821 • Published Jun 12 • 19
Jan-nano Technical Report

Paper • 2506.22760 • Published Jun 28 • 9
MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 64
WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs