Collections
Discover the best community collections!
Collections including paper arxiv:2310.08864
-
π_0: A Vision-Language-Action Flow Model for General Robot Control
Paper • 2410.24164 • Published • 29 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper • 2310.08864 • Published • 2 -
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Paper • 2502.13143 • Published • 31
-
SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving
Paper • 2402.02519 • Published -
Mixtral of Experts
Paper • 2401.04088 • Published • 161 -
Optimal Transport Aggregation for Visual Place Recognition
Paper • 2311.15937 • Published -
GOAT: GO to Any Thing
Paper • 2311.06430 • Published • 16
-
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper • 2504.07128 • Published • 86 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 108 -
BitNet b1.58 2B4T Technical Report
Paper • 2504.12285 • Published • 75 -
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Paper • 2501.09747 • Published • 27
-
ChatGPT Robotics
🤖62 -
openvla/openvla-7b
Image-Text-to-Text • 8B • Updated • 284k • 156 -
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
Paper • 2311.01455 • Published • 30 -
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper • 2310.08864 • Published • 2
-
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper • 2504.07128 • Published • 86 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 108 -
BitNet b1.58 2B4T Technical Report
Paper • 2504.12285 • Published • 75 -
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Paper • 2501.09747 • Published • 27
-
π_0: A Vision-Language-Action Flow Model for General Robot Control
Paper • 2410.24164 • Published • 29 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper • 2310.08864 • Published • 2 -
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Paper • 2502.13143 • Published • 31
-
ChatGPT Robotics
🤖62 -
openvla/openvla-7b
Image-Text-to-Text • 8B • Updated • 284k • 156 -
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
Paper • 2311.01455 • Published • 30 -
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper • 2310.08864 • Published • 2
-
SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving
Paper • 2402.02519 • Published -
Mixtral of Experts
Paper • 2401.04088 • Published • 161 -
Optimal Transport Aggregation for Visual Place Recognition
Paper • 2311.15937 • Published -
GOAT: GO to Any Thing
Paper • 2311.06430 • Published • 16