Li Peiyan's picture

2 3

Li Peiyan

LPY

·

https://github.com/LPY1219

AI & ML interests

Embodied AI

Recent Activity

upvoted a paper 17 days ago

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

published a model 3 months ago

LPY/test

upvoted a paper 4 months ago

GR-3 Technical Report

View all activity

Organizations

authored 3 papers 5 months ago

Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

Paper • 2412.14058 • Published Dec 18, 2024 • 1

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 34

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Paper • 2506.07961 • Published Jun 9 • 11