-
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning
Paper • 2510.14265 • Published • 19 -
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Paper • 2510.15110 • Published • 15 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
Learning Optimal Predictive Checklists
Paper • 2112.01020 • Published • 1
Weston
SirBubblesIII
AI & ML interests
data quality over quantity all the way
Recent Activity
updated
a collection
10 days ago
Cool stuff
liked
a model
10 days ago
vitalune/nanochat-d10-filtered-500m
updated
a collection
30 days ago
Cool stuff