Metal-Sci: A Scientific Compute Benchmark for Evolutionary LLM Kernel Search on Apple Silicon Paper • 2605.09708 • Published 15 days ago • 5
Metal-Sci: A Scientific Compute Benchmark for Evolutionary LLM Kernel Search on Apple Silicon Paper • 2605.09708 • Published 15 days ago • 5
Discovering Agentic Safety Specifications from 1-Bit Danger Signals Paper • 2604.23210 • Published 30 days ago • 4
Discovering Agentic Safety Specifications from 1-Bit Danger Signals Paper • 2604.23210 • Published 30 days ago • 4
STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems Paper • 2603.22359 • Published Mar 22 • 4
Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas Paper • 2603.19453 • Published Mar 19 • 6
Cooperation and Exploitation in LLM Policy Synthesis for Sequential Social Dilemmas Paper • 2603.19453 • Published Mar 19 • 6
Multimodal Models 🔀 Collection A collection of multimodal models developed by the Komorebi AI team • 3 items • Updated Sep 23, 2025 • 2
Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement Paper • 2507.18742 • Published Jul 24, 2025 • 6
Merging Improves Self-Critique Against Jailbreak Attacks Paper • 2406.07188 • Published Jun 11, 2024 • 4
Configurable Safety Tuning of Language Models with Synthetic Preference Data Paper • 2404.00495 • Published Mar 30, 2024 • 2
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs Paper • 2402.08005 • Published Feb 12, 2024 • 1