Inference Providers
Active filters: cuda
Text Generation
• 8B • Updated • 68.9k
• 694
ussoewwin/Flash-Attention-2_for_Windows
Updated • 97
atomicmilkshake/llama-cpp-turboquant-binaries
Multilingual-Multimodal-NLP/IndustrialCoder
Text Generation
• 32B • Updated • 248
• 62
Text Generation
• 4B • Updated • 6.19k
• 43
ValiantLabs/gemma-4-E2B-it-ShiningValiant3
Image-Text-to-Text
• 5B • Updated • 136
• 4
Text Generation
• Updated • 9
• 23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated • 8
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
• 8B • Updated • 24
• 3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B • Updated • 3.53k
• 2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B • Updated • 2.24k
• 2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
• 2B • Updated • 9
• • 5
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B • Updated • 163
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B • Updated • 341
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
• 4B • Updated • 16
• • 7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated • 5
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated • 4
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-GGUF
3B • Updated • 334
• 2
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-GGUF
2B • Updated • 118
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-GGUF
2B • Updated • 213
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-i1-GGUF
2B • Updated • 514
• 1
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-i1-GGUF
2B • Updated • 211
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-i1-GGUF
3B • Updated • 326
• 1
mradermacher/Qwen3-Shining-Valiant-Instruct-Fast-CODER-Reasoning-2.4B-GGUF
2B • Updated • 147
mradermacher/Qwen3-Shining-Valiant-Instruct-Fast-CODER-Reasoning-2.4B-i1-GGUF
2B • Updated • 192
mradermacher/Qwen3-Shining-Valiant-Instruct-CODER-Reasoning-2.7B-GGUF
3B • Updated • 169
mradermacher/Qwen3-Shining-Valiant-Instruct-CODER-Reasoning-2.7B-i1-GGUF
3B • Updated • 492
mradermacher/Qwen3-Shining-Lucy-CODER-3.4B-Brainstorm20x-e32-GGUF
3B • Updated • 452