Guiding a Diffusion Transformer with the Internal Dynamics of Itself Paper • 2512.24176 • Published 3 days ago • 7 • 3
Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers Paper • 2512.22564 • Published 6 days ago • 5 • 3
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time Paper • 2512.25075 • Published 2 days ago • 8 • 2
Scaling Open-Ended Reasoning to Predict the Future Paper • 2512.25070 • Published 2 days ago • 13 • 3
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation Paper • 2512.22905 • Published 5 days ago • 10 • 3
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper • 2512.25073 • Published 2 days ago • 24 • 3
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published 4 days ago • 10 • 3
Valori: A Deterministic Memory Substrate for AI Systems Paper • 2512.22280 • Published 8 days ago • 3 • 3
Factorized Learning for Temporally Grounded Video-Language Models Paper • 2512.24097 • Published 3 days ago • 5 • 3
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking Paper • 2512.24297 • Published 3 days ago • 5 • 2
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 3 days ago • 57 • 3
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems Paper • 2512.24385 • Published 3 days ago • 7 • 3
A unified framework for detecting point and collective anomalies in operating system logs via collaborative transformers Paper • 2512.23380 • Published 4 days ago • 22 • 3
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation Paper • 2512.24551 • Published 3 days ago • 14 • 4
BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts Paper • 2512.24885 • Published 2 days ago • 4 • 3
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published 4 days ago • 19 • 3
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published 2 days ago • 36 • 3