Suzie Oh's picture

Suzie Oh

ohsuz

·

ohsuz

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 hour ago

ibm-research/ToolRM-train-data

liked a Space 2 days ago

huggingface-KREW/Ko-AgentBench

liked a dataset 10 days ago

ibm-research/fc-reward-bench

View all activity

Organizations

upvoted a paper 17 days ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 48

upvoted a paper 18 days ago

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 19 days ago • 31

upvoted 2 collections 19 days ago

— UI is a good thing 💅 —

cool spaces with a cool UI, what could be better? • 5 items • Updated May 5 • 25

[NeurIPS 2025] RPC Resources

Sampled Reasoning Paths for NeurIPS 2025 Paper: A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning • 6 items • Updated 9 days ago • 8

upvoted a paper 22 days ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Paper • 2510.04230 • Published 27 days ago • 26

upvoted a collection about 1 month ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 46 items • Updated Sep 10 • 131

upvoted a paper about 1 month ago

New Trends for Modern Machine Translation with Large Reasoning Models

Paper • 2503.10351 • Published Mar 13 • 25

upvoted a collection about 2 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 10 days ago • 18

upvoted a collection 2 months ago

Hermes 4 Collection

11 items • Updated Sep 8 • 70

upvoted a collection 3 months ago

Tool Use Reasoning

A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23 • 8

upvoted 2 papers 3 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 126

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Paper • 2508.02120 • Published Aug 4 • 19

upvoted a collection 5 months ago

[New] AI Technologies & Services

'OpenFree AI' 커뮤니티: https://discord.gg/openfreeai • 182 items • Updated 7 days ago • 21

upvoted a paper 5 months ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15 • 11

upvoted 4 collections 5 months ago

Multimodal Reasoning

130 items • Updated 6 days ago • 33

Papers to Read

208 items • Updated Aug 24 • 10

Multimodal LLM

313 items • Updated 5 days ago • 39

VisionLM

1701 items • Updated 3 days ago • 127

upvoted 2 papers 5 months ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 132

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models

Paper • 2501.14818 • Published Jan 20 • 9