Menlo Research

Team

company

Verified

https://www.menlo.ai

menloresearch

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

alandao new activity about 14 hours ago

Menlo/Jan-nano-128k:Scaling

bachvudinh updated a dataset 6 days ago

Menlo/R-Horizon-vision-train-ready

bachvudinh published a dataset 6 days ago

Menlo/R-Horizon-vision-train-ready

View all activity

alandao

in Menlo/Jan-nano-128k about 14 hours ago

Scaling

🔥 1

#11 opened about 15 hours ago by

ziryeg

bachvudinh

updated a dataset 6 days ago

Menlo/R-Horizon-vision-train-ready

Viewer • Updated 6 days ago • 2.28k • 15

bachvudinh

published a dataset 6 days ago

Menlo/R-Horizon-vision-train-ready

Viewer • Updated 6 days ago • 2.28k • 15

bachvudinh

updated a dataset 19 days ago

Menlo/Instruction-Synthetic-v0.4

Viewer • Updated 19 days ago • 20.4k • 32

bachvudinh

published a dataset 19 days ago

Menlo/Instruction-Synthetic-v0.4

Viewer • Updated 19 days ago • 20.4k • 32

bachvudinh

updated a dataset 19 days ago

Menlo/Instruction-Synthetic-v0.3

Viewer • Updated 19 days ago • 25.2k • 61

alandao

authored a paper 4 months ago

Jan-nano Technical Report

Paper • 2506.22760 • Published Jun 28 • 9

alandao

posted an update 4 months ago

Post

1337

Don’t give up 🔥

Do you know what I was planning to do this time last week?

I was preparing to write a report declaring that Jan Nano was a failed project because the benchmark results didn’t meet expectations.

But I thought — it can’t be. When loading the model into the app, the performance clearly felt better. So why were the benchmark results worse?

That’s when I reviewed the entire benchmark codebase and realized something fundamental: agentic or workflow-based approaches introduce a huge gap and variation when benchmarking. Jan-nano was trained with an agentic setup — it simply can’t be benchmarked using a rigid workflow-based method.

I made the necessary changes, and the model ended up performing even better than before the issues arose. Turns out the previous benchmarking method conflicted with the way the model was trained.

What if I had given up? That would’ve meant 1.5 months of training and a huge amount of company resources wasted.

But now, this is officially the most successful and biggest release for the whole team — all thanks to Jan-nano.

Menlo/Jan-nano