Yanwei Li's picture

2 9 7

Yanwei Li

YanweiLi

·

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

authored a paper 4 days ago

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

authored a paper 4 days ago

LISA: Reasoning Segmentation via Large Language Model

View all activity

Organizations

None yet

upvoted 3 papers 6 days ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 7 days ago • 55

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published 7 days ago • 170

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published 13 days ago • 41

upvoted a paper 6 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 152

upvoted a paper 11 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 48

upvoted a paper about 1 year ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 60

upvoted a paper over 1 year ago

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 47

upvoted 2 collections over 1 year ago

MGM-Data

Official data collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 2 items • Updated Apr 21, 2024 • 7

MGM

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47