Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

speculative-decoding

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

46

Full-text search

Active filters: speculative-decoding

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Aug 8 • 307 • 1

taobao-mnn/Qwen3-VL-2B-Instruct-Eagle3

Text Generation • 0.1B • Updated 11 days ago • 52 • 2

mradermacher/DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF

0.5B • Updated Jul 11 • 436

mradermacher/DeepSeek-V3-0324-DRAFT-0.5B-v1.0-GGUF

0.5B • Updated Jul 11 • 249

Gapeleon/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-Q4_K_M-GGUF

0.6B • Updated Jun 10 • 7

Goldenwert/multitoken-gpt2-metamathqa

Text Generation • Updated Jun 10 • 1

mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.0-GGUF

0.6B • Updated Jul 11 • 245

mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-GGUF

0.6B • Updated Jul 11 • 366

corupta/dseek-draft-test

0.8B • Updated Jun 13 • 7

nm-testing/eagle-llama3.1-8b-instruct

0.3B • Updated Jul 9 • 4

nm-testing/hass-llama3.1-8b-layernorms

0.3B • Updated Jul 9 • 4

mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.1-GGUF

0.6B • Updated Jul 11 • 210

mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.1-GGUF

0.6B • Updated Jul 11 • 226

mradermacher/DeepSeek-R1-DRAFT-0.6B-v2.0-GGUF

0.6B • Updated Jul 20 • 149

mradermacher/DeepSeek-V3-DRAFT-0.6B-v2.0-GGUF

0.6B • Updated Jul 22 • 119 • 1

jukofyork/GLM-4.5-DRAFT-0.6B-v3.0

0.6B • Updated Aug 9 • 48 • 3

jukofyork/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 9 • 471 • 17

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 8 • 233

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0

0.6B • Updated Aug 10 • 2 • 1

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 9 • 82

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 10 • 169

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Aug 10 • 203

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0

0.6B • Updated Aug 10 • 3

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 10 • 64

jukofyork/Qwen3-0.6B-YaRN-GGUF

0.8B • Updated Aug 10 • 72 • 3

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0

0.7B • Updated Aug 11 • 4 • 1

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF

0.7B • Updated Aug 11 • 106

jukofyork/Qwen3-Coder-Instruct-DRAFT-0.75B-GGUF

0.8B • Updated Aug 11 • 400 • 5

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 12 • 164

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Aug 12 • 364