Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents Paper • 2509.06917 • Published Sep 8 • 41 • 7
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning Paper • 2508.19828 • Published Aug 27 • 6 • 1
Clio: Privacy-Preserving Insights into Real-World AI Use Paper • 2412.13678 • Published Dec 18, 2024 • 1 • 1
Conformal Prediction of Classifiers with Many Classes based on Noisy Labels Paper • 2501.12749 • Published Jan 22 • 1 • 1
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8 • 27 • 2
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling Paper • 2507.07955 • Published Jul 10 • 25 • 4
Towards Effective Extraction and Evaluation of Factual Claims Paper • 2502.10855 • Published Feb 15 • 3 • 2
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Paper • 2504.11343 • Published Apr 15 • 19 • 6
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model Paper • 2504.05594 • Published Apr 8 • 11 • 3
Manify: A Python Library for Learning Non-Euclidean Representations Paper • 2503.09576 • Published Mar 12 • 1 • 1
Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures Paper • 2502.05078 • Published Feb 7 • 3 • 1
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published Feb 23 • 27 • 3
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging Paper • 2502.05664 • Published Feb 8 • 24 • 3
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published Dec 16, 2024 • 11 • 5
Stream of Search (SoS): Learning to Search in Language Paper • 2404.03683 • Published Apr 1, 2024 • 31 • 1