3 8 6

gggg

justin6667

AI & ML interests

None yet

Recent Activity

liked a model 2 months ago

baidu/ERNIE-4.5-VL-424B-A47B-PT

liked a model 2 months ago

baidu/ERNIE-4.5-21B-A3B-Thinking

upvoted a paper 3 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

View all activity

Organizations

None yet

liked 2 models 2 months ago

baidu/ERNIE-4.5-VL-424B-A47B-PT

Image-Text-to-Text • 424B • Updated 9 days ago • 1.97k • 99

baidu/ERNIE-4.5-21B-A3B-Thinking

Text Generation • 22B • Updated 9 days ago • 14.3k • • 766

upvoted a paper 3 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

liked a Space 7 months ago

Gemini Balance

🐨

Display a loading screen with a spinner

updated a Space 7 months ago

Gb Hahha

🌍

baipiao

published a Space 7 months ago

Gb Hahha

🌍

baipiao

liked a model 9 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 713k • • 12.9k

liked a Space 9 months ago

README

📈

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

886

upvoted a paper 10 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

New activity in deepseek-ai/DeepSeek-V3-Base 11 months ago

应该把字节、阿里、百度的钱和显卡都分给deepseek，不然浪费资源啊

🤗 11

#23 opened 11 months ago by

eatcosmos

liked a dataset about 1 year ago

princeton-nlp/SWE-bench_bm25_50k_llama

Viewer • Updated Apr 15, 2024 • 2.29k • 137 • 6

upvoted 3 collections over 1 year ago

upvoted 2 papers over 1 year ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 48

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 107

updated a model almost 2 years ago

justin6667/vit-base-patch16-224-in21k-finetuned-lora-food101

Image Classification • 85.9M • Updated Feb 15, 2024 • 3

gggg

AI & ML interests

Recent Activity

Organizations

justin6667's activity

Gemini Balance

Gb Hahha

Gb Hahha

README

Open-R1: a fully open reproduction of DeepSeek-R1

应该把字节、阿里、百度的钱和显卡都分给deepseek，不然浪费资源啊