2 9 7

Yanwei Li

YanweiLi

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

authored a paper 5 days ago

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

authored a paper 5 days ago

LISA: Reasoning Segmentation via Large Language Model

View all activity

Organizations

None yet

authored 11 papers 5 days ago

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

Paper • 2505.24164 • Published May 30

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Paper • 2506.24102 • Published Jun 30

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 28

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published 14 days ago • 35

upvoted 3 papers 7 days ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 8 days ago • 56

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published 8 days ago • 172

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published 14 days ago • 41

liked a Space 2 months ago

MGM Omni

🎙

Scaling Omni LLMs to Personalized Long-Horizon Speech

liked a Space 6 months ago

158

Seed1.5 VL

🚀

Seed1.5-VL API Demo

upvoted a paper 6 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 152

authored a paper 7 months ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published Apr 14 • 27

authored a paper 9 months ago

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13 • 28

authored a paper 11 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 48

Yanwei Li

AI & ML interests

Recent Activity

Organizations

YanweiLi's activity

MGM Omni

Seed1.5 VL