Yuqi Wang's picture

Yuqi Wang

Greenbean

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

upvoted a paper 21 days ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

upvoted a paper about 2 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

View all activity

Organizations

upvoted a paper 15 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published 18 days ago • 86

upvoted a paper 21 days ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 25 days ago • 49

upvoted a paper about 2 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 78

upvoted 3 papers 3 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 156

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 72

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted 4 papers 6 months ago

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Paper • 2505.13426 • Published May 19 • 13

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19 • 50

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 152

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82

upvoted 8 papers 9 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 54

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 132

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published Feb 11 • 29

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 89

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 68

upvoted 2 papers over 1 year ago

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 32

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104