Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Paper • 2601.06021 • Published 8 days ago • 39
zai-org/AutoGLM-Phone-9B-Multilingual Image-Text-to-Text • 934k • Updated 10 days ago • 11.9k • • 219
Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR Paper • 2509.02522 • Published Sep 2, 2025 • 25
IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property Paper • 2504.15524 • Published Apr 22, 2025 • 3
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning Paper • 2504.19627 • Published Apr 28, 2025
CLaSp: In-Context Layer Skip for Self-Speculative Decoding Paper • 2505.24196 • Published May 30, 2025 • 12
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 199
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 199