Peking University

university

https://www.pku.edu.cn/

Activity Feed Request to join this org

AI & ML interests

nlu

Recent Activity

Skywalker0410 submitted a paper 1 day ago

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

jianzongwu submitted a paper 2 days ago

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

yunyangge submitted a paper 10 days ago

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

View all activity

Papers

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

View all Papers

Skywalker0410

submitted a paper to Daily Papers 1 day ago

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

Paper • 2606.06155 • Published 3 days ago • 8

jianzongwu

submitted a paper to Daily Papers 2 days ago

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

Paper • 2606.06042 • Published 3 days ago • 21

yunyangge

submitted a paper to Daily Papers 10 days ago

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

Paper • 2605.28691 • Published 11 days ago • 24

sjj118

submitted a paper to Daily Papers 18 days ago

RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting

Paper • 2605.18263 • Published 20 days ago • 9

yfdeng10

submitted a paper to Daily Papers 19 days ago

StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

Paper • 2605.18287 • Published 20 days ago • 15

fxmeng

submitted a paper to Daily Papers 20 days ago

GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding

Paper • 2605.15250 • Published 24 days ago • 13

IvanTang

submitted a paper to Daily Papers 23 days ago

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

Paper • 2605.15186 • Published 24 days ago • 26

SteveZeyuZhang

submitted a paper to Daily Papers 24 days ago

PresentAgent-2: Towards Generalist Multimodal Presentation Agents

Paper • 2605.11363 • Published 26 days ago • 8

SteveZeyuZhang

submitted a paper to Daily Papers 25 days ago

Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction

Paper • 2605.11354 • Published 26 days ago • 1

N2048M

submitted a paper to Daily Papers about 1 month ago

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

Paper • 2604.26951 • Published Apr 29 • 48

SteveZeyuZhang

submitted a paper to Daily Papers about 2 months ago

UniMesh: Unifying 3D Mesh Understanding and Generation

Paper • 2604.17472 • Published Apr 19 • 11

rajkumarrawal

submitted a paper to Daily Papers about 2 months ago

Context-Value-Action Architecture for Value-Driven Large Language Model Agents

Paper • 2604.05939 • Published Apr 7 • 10

zbhpku

submitted a paper to Daily Papers 2 months ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 366

xishushu

submitted a paper to Daily Papers 2 months ago

Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models

Paper • 2603.22782 • Published Mar 24 • 20

zyh200727

submitted a paper to Daily Papers 2 months ago

PEARL: Personalized Streaming Video Understanding Model

Paper • 2603.20422 • Published Mar 20 • 40

lyl010221-pku

submitted a paper to Daily Papers 3 months ago

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Paper • 2603.15618 • Published Mar 16 • 21

SteveZeyuZhang

submitted a paper to Daily Papers 3 months ago

MWM: Mobile World Models for Action-Conditioned Consistent Prediction

Paper • 2603.07799 • Published Mar 8

SteveZeyuZhang

submitted 3 papers to Daily Papers 4 months ago

StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation

Paper • 2602.16915 • Published Feb 18

MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation

Paper • 2602.14534 • Published Feb 16 • 3

Light4D: Training-Free Extreme Viewpoint 4D Video Relighting

Paper • 2602.11769 • Published Feb 12 • 2

AI & ML interests

Recent Activity

Papers

Team members 1

PekingUniversity's activity