Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

speculative-decoding

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

44

Full-text search

Active filters: speculative-decoding

taobao-mnn/Qwen3-VL-2B-Instruct-Eagle3

Text Generation • 0.1B • Updated 6 days ago • 41 • 2

mradermacher/DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF

0.5B • Updated Jul 11 • 385

mradermacher/DeepSeek-V3-0324-DRAFT-0.5B-v1.0-GGUF

0.5B • Updated Jul 11 • 243

Gapeleon/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-Q4_K_M-GGUF

0.6B • Updated Jun 10 • 7

Goldenwert/multitoken-gpt2-metamathqa

Text Generation • Updated Jun 10 • 1

mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.0-GGUF

0.6B • Updated Jul 11 • 224

mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-GGUF

0.6B • Updated Jul 11 • 333

corupta/dseek-draft-test

0.8B • Updated Jun 13 • 2

nm-testing/eagle-llama3.1-8b-instruct

0.3B • Updated Jul 9 • 4

nm-testing/hass-llama3.1-8b-layernorms

0.3B • Updated Jul 9 • 4

mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.1-GGUF

0.6B • Updated Jul 11 • 188

mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.1-GGUF

0.6B • Updated Jul 11 • 204

mradermacher/DeepSeek-R1-DRAFT-0.6B-v2.0-GGUF

0.6B • Updated Jul 20 • 219

mradermacher/DeepSeek-V3-DRAFT-0.6B-v2.0-GGUF

0.6B • Updated Jul 22 • 138 • 1

jukofyork/GLM-4.5-DRAFT-0.6B-v3.0

0.6B • Updated Aug 9 • 46 • 3

jukofyork/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 9 • 405 • 17

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 8 • 335

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Aug 8 • 443

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0

0.6B • Updated Aug 10 • 3 • 1

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 9 • 73

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 10 • 156

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Aug 10 • 192

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0

0.6B • Updated Aug 10 • 4

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 10 • 53

jukofyork/Qwen3-0.6B-YaRN-GGUF

0.8B • Updated Aug 10 • 83 • 3

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0

0.7B • Updated Aug 11 • 4 • 1

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF

0.7B • Updated Aug 11 • 84

jukofyork/Qwen3-Coder-Instruct-DRAFT-0.75B-GGUF

0.8B • Updated Aug 11 • 380 • 5

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 12 • 151

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Aug 12 • 323