Amir Mohseni's picture

3 9 43

Amir Mohseni PRO

AmirMohseni

·

AI & ML interests

LLMs, VLMs, VLAs

Recent Activity

updated a model about 10 hours ago

AmirMohseni/skywork-reward-v2-llama-3.1-8b-rank128-eduarena-1turn-all-data-lmarena-1turn-all-data

published a model about 18 hours ago

AmirMohseni/skywork-reward-v2-llama-3.1-8b-rank128-eduarena-1turn-all-data-lmarena-1turn-all-data

published a model about 18 hours ago

AmirMohseni/skywork-reward-v2-llama-3.1-8b-rank128-eduarena-all-data-lmarena-all-data

View all activity

Organizations

upvoted an article 10 days ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

By

•

Apr 16

• 51

upvoted 2 collections about 1 month ago

Quantization Spaces on the Hub ⚡

A collection of spaces that allow you to quantize on the Hub • 4 items • Updated Nov 4, 2024 • 6

Reasoning Router

Reasoning Router explores routing for hybrid models between “Thinking” (accurate) and “Non-Thinking” (fast) modes using open models (Qwen3) • 8 items • Updated Sep 25 • 2

upvoted a collection 6 months ago

Qwen3

84 items • Updated Aug 6 • 1.39k

upvoted an article 8 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted a collection 10 months ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 27

upvoted an article 11 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 140

upvoted a collection about 1 year ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 237