1 12 88

Maojia Song

OrangeEye

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

liked a Space 13 days ago

HuggingFaceTB/smol-training-playbook

upvoted a paper about 1 month ago

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics

View all activity

Organizations

upvoted a paper 11 days ago

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Paper • 2508.18321 • Published Aug 24 • 2

upvoted a paper about 1 month ago

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics

Paper • 2510.05137 • Published Oct 1 • 4

upvoted a paper about 2 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 115

upvoted a paper 3 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7 • 138

upvoted an article 5 months ago

Article

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

Dec 25, 2024

•

upvoted a paper 7 months ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17 • 39

upvoted an article 8 months ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

•

upvoted 3 collections 9 months ago

upvoted a paper about 1 year ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45

upvoted a collection over 1 year ago

Direct Preference Optimization Datasets

Collection

Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api • 5520 items • Updated Apr 6 • 7