Text Generation
Transformers
GGUF
English
Merge
programming
code generation
code
coding
coder
chat
brainstorm
qwen
qwen3
qwencoder
brainstorm20x
esper
esper-3
valiant
valiant-labs
qwen-3
qwen-3-8b
8b
reasoning
code-instruct
python
javascript
dev-ops
jenkins
terraform
scripting
powershell
azure
aws
gcp
cloud
problem-solving
architect
engineer
developer
creative
analytical
expert
rationality
conversational
instruct
llama-cpp
gguf-my-repo
| license: apache-2.0 | |
| base_model: DavidAU/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x | |
| language: | |
| - en | |
| pipeline_tag: text-generation | |
| tags: | |
| - merge | |
| - programming | |
| - code generation | |
| - code | |
| - coding | |
| - coder | |
| - chat | |
| - brainstorm | |
| - qwen | |
| - qwen3 | |
| - qwencoder | |
| - brainstorm20x | |
| - esper | |
| - esper-3 | |
| - valiant | |
| - valiant-labs | |
| - qwen-3 | |
| - qwen-3-8b | |
| - 8b | |
| - reasoning | |
| - code-instruct | |
| - python | |
| - javascript | |
| - dev-ops | |
| - jenkins | |
| - terraform | |
| - scripting | |
| - powershell | |
| - azure | |
| - aws | |
| - gcp | |
| - cloud | |
| - problem-solving | |
| - architect | |
| - engineer | |
| - developer | |
| - creative | |
| - analytical | |
| - expert | |
| - rationality | |
| - conversational | |
| - instruct | |
| - llama-cpp | |
| - gguf-my-repo | |
| datasets: | |
| - sequelbox/Titanium2.1-DeepSeek-R1 | |
| - sequelbox/Tachibana2-DeepSeek-R1 | |
| - sequelbox/Raiden-DeepSeek-R1 | |
| library_name: transformers | |
| # Triangle104/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x-Q5_K_M-GGUF | |
| This model was converted to GGUF format from [`DavidAU/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x`](https://huggingface.co/DavidAU/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space. | |
| Refer to the [original model card](https://huggingface.co/DavidAU/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x) for more details on the model. | |
| --- | |
| This model contains Brainstorm 20x, combined with ValiantLabs's 8B General / Coder (instruct model): | |
| https://huggingface.co/ValiantLabs/Qwen3-8B-Esper3 | |
| Information on the 8B model below, followed by Brainstorm 20x adapter (by DavidAU) and then a complete help section for running LLM / AI models. | |
| The Brainstorm adapter improves code generation, and unique code solving abilities. | |
| This model requires: | |
| - Jinja (embedded) or CHATML template | |
| - Max context of 40k. | |
| Settings used for testing (suggested): | |
| - Temp .3 to .7 | |
| - Rep pen 1.05 to 1.1 | |
| - Topp .8 , minp .05 | |
| - Topk 20 | |
| - No system prompt. | |
| FOR CODING: | |
| Higher temps: .6 to .9 (even over 1) work better for more complex coding / especially with more restrictions. | |
| This model will respond well to both detailed instructions and step by step refinement and additions to code. | |
| As this is an instruct model, it will also benefit from a detailed system prompt too. | |
| --- | |
| ## Use with llama.cpp | |
| Install llama.cpp through brew (works on Mac and Linux) | |
| ```bash | |
| brew install llama.cpp | |
| ``` | |
| Invoke the llama.cpp server or the CLI. | |
| ### CLI: | |
| ```bash | |
| llama-cli --hf-repo Triangle104/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x-Q5_K_M-GGUF --hf-file qwen3-esper3-reasoning-coder-instruct-12b-brainstorm20x-q5_k_m.gguf -p "The meaning to life and the universe is" | |
| ``` | |
| ### Server: | |
| ```bash | |
| llama-server --hf-repo Triangle104/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x-Q5_K_M-GGUF --hf-file qwen3-esper3-reasoning-coder-instruct-12b-brainstorm20x-q5_k_m.gguf -c 2048 | |
| ``` | |
| Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well. | |
| Step 1: Clone llama.cpp from GitHub. | |
| ``` | |
| git clone https://github.com/ggerganov/llama.cpp | |
| ``` | |
| Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux). | |
| ``` | |
| cd llama.cpp && LLAMA_CURL=1 make | |
| ``` | |
| Step 3: Run inference through the main binary. | |
| ``` | |
| ./llama-cli --hf-repo Triangle104/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x-Q5_K_M-GGUF --hf-file qwen3-esper3-reasoning-coder-instruct-12b-brainstorm20x-q5_k_m.gguf -p "The meaning to life and the universe is" | |
| ``` | |
| or | |
| ``` | |
| ./llama-server --hf-repo Triangle104/Qwen3-Esper3-Reasoning-CODER-Instruct-12B-Brainstorm20x-Q5_K_M-GGUF --hf-file qwen3-esper3-reasoning-coder-instruct-12b-brainstorm20x-q5_k_m.gguf -c 2048 | |
| ``` | |