AI & ML interests

None defined yet.

Recent Activity

alandao  new activity about 14 hours ago
Menlo/Jan-nano-128k:Scaling
bachvudinh  updated a dataset 6 days ago
Menlo/R-Horizon-vision-train-ready
bachvudinh  published a dataset 6 days ago
Menlo/R-Horizon-vision-train-ready
View all activity

alandao 
in Menlo/Jan-nano-128k about 14 hours ago

Scaling

🔥 1
2
#11 opened about 15 hours ago by
ziryeg
alandao 
posted an update 4 months ago
view post
Post
1337
Don’t give up 🔥

Do you know what I was planning to do this time last week?

I was preparing to write a report declaring that Jan Nano was a failed project because the benchmark results didn’t meet expectations.

But I thought — it can’t be. When loading the model into the app, the performance clearly felt better. So why were the benchmark results worse?

That’s when I reviewed the entire benchmark codebase and realized something fundamental: agentic or workflow-based approaches introduce a huge gap and variation when benchmarking. Jan-nano was trained with an agentic setup — it simply can’t be benchmarked using a rigid workflow-based method.

I made the necessary changes, and the model ended up performing even better than before the issues arose. Turns out the previous benchmarking method conflicted with the way the model was trained.

What if I had given up? That would’ve meant 1.5 months of training and a huge amount of company resources wasted.

But now, this is officially the most successful and biggest release for the whole team — all thanks to Jan-nano.

Menlo/Jan-nano