1 14

linjianman

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

MADD: Multi-Agent Drug Discovery Orchestra

upvoted a paper 20 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

upvoted a paper 20 days ago

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

View all activity

Organizations

None yet

upvoted a paper 17 days ago

MADD: Multi-Agent Drug Discovery Orchestra

Paper • 2511.08217 • Published 20 days ago • 55

upvoted 2 papers 20 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published 24 days ago • 52

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Paper • 2506.03135 • Published Jun 3 • 39

upvoted 2 papers about 2 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 265

Revisiting Modeling and Evaluation Approaches in Speech Emotion Recognition: Considering Subjectivity of Annotators and Ambiguity of Emotions

Paper • 2510.05934 • Published Oct 7 • 2

upvoted a paper 2 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24 • 118

upvoted a paper 3 months ago

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

Paper • 2508.20478 • Published Aug 28 • 17

upvoted 2 papers 4 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 312

upvoted 2 papers 5 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 88

Geometry-Editable and Appearance-Preserving Object Compositon

Paper • 2505.20914 • Published May 27 • 6

upvoted an article 5 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12

•

567

upvoted 2 papers 6 months ago

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published May 29 • 68

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 79

commented a paper about 1 year ago

ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

Paper • 2403.18818 • Published Mar 27, 2024 • 28 •

linjianman

AI & ML interests

Recent Activity

Organizations

linjianman's activity

Vision Language Models (Better, faster, stronger)