-
Rewnozom/agent-zero-v1-a-01
Text Generation β’ 4B β’ Updated β’ 1 -
TheBloke/MythoMax-L2-13B-GGUF
13B β’ Updated β’ 57.7k β’ 196 -
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation β’ 18B β’ Updated β’ 49.9k β’ 383 -
QuantFactory/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored-GGUF
Text Generation β’ 8B β’ Updated β’ 12.9k β’ 118
Collections
Discover the best community collections!
Collections including paper arxiv:2501.12948
-
15.5k
DeepSite v3
π³Generate any application by Vibe Coding
-
deepseek-ai/DeepSeek-R1-0528
Text Generation β’ 685B β’ Updated β’ 568k β’ β’ 2.38k -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 420 -
open-r1/Mixture-of-Thoughts
Viewer β’ Updated β’ 699k β’ 5.53k β’ 284
-
ibm-granite/granite-3.2-8b-instruct
Text Generation β’ 8B β’ Updated β’ 4.36k β’ 87 -
deepseek-ai/DeepSeek-V3-0324
Text Generation β’ 685B β’ Updated β’ 315k β’ β’ 3.07k -
Qwen/Qwen2.5-Omni-7B
Any-to-Any β’ 11B β’ Updated β’ 245k β’ 1.81k -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer β’ Updated β’ 3.91M β’ 3.79k β’ 592
-
deepseek-ai/DeepSeek-V3-0324
Text Generation β’ 685B β’ Updated β’ 315k β’ β’ 3.07k -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 420 -
deepseek-ai/DeepSeek-R1
Text Generation β’ 685B β’ Updated β’ 462k β’ β’ 12.8k -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper β’ 2402.03300 β’ Published β’ 129
-
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper β’ 2401.02954 β’ Published β’ 48 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper β’ 2401.06066 β’ Published β’ 56 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper β’ 2401.14196 β’ Published β’ 66 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper β’ 2402.03300 β’ Published β’ 129
-
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model
Paper β’ 2503.24290 β’ Published β’ 62 -
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders
Paper β’ 2503.18878 β’ Published β’ 119 -
START: Self-taught Reasoner with Tools
Paper β’ 2503.04625 β’ Published β’ 113 -
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper β’ 2503.14476 β’ Published β’ 141
-
Rewnozom/agent-zero-v1-a-01
Text Generation β’ 4B β’ Updated β’ 1 -
TheBloke/MythoMax-L2-13B-GGUF
13B β’ Updated β’ 57.7k β’ 196 -
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation β’ 18B β’ Updated β’ 49.9k β’ 383 -
QuantFactory/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored-GGUF
Text Generation β’ 8B β’ Updated β’ 12.9k β’ 118
-
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper β’ 2401.02954 β’ Published β’ 48 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper β’ 2401.06066 β’ Published β’ 56 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper β’ 2401.14196 β’ Published β’ 66 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper β’ 2402.03300 β’ Published β’ 129
-
15.5k
DeepSite v3
π³Generate any application by Vibe Coding
-
deepseek-ai/DeepSeek-R1-0528
Text Generation β’ 685B β’ Updated β’ 568k β’ β’ 2.38k -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 420 -
open-r1/Mixture-of-Thoughts
Viewer β’ Updated β’ 699k β’ 5.53k β’ 284
-
ibm-granite/granite-3.2-8b-instruct
Text Generation β’ 8B β’ Updated β’ 4.36k β’ 87 -
deepseek-ai/DeepSeek-V3-0324
Text Generation β’ 685B β’ Updated β’ 315k β’ β’ 3.07k -
Qwen/Qwen2.5-Omni-7B
Any-to-Any β’ 11B β’ Updated β’ 245k β’ 1.81k -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer β’ Updated β’ 3.91M β’ 3.79k β’ 592
-
deepseek-ai/DeepSeek-V3-0324
Text Generation β’ 685B β’ Updated β’ 315k β’ β’ 3.07k -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 420 -
deepseek-ai/DeepSeek-R1
Text Generation β’ 685B β’ Updated β’ 462k β’ β’ 12.8k -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper β’ 2402.03300 β’ Published β’ 129
-
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model
Paper β’ 2503.24290 β’ Published β’ 62 -
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders
Paper β’ 2503.18878 β’ Published β’ 119 -
START: Self-taught Reasoner with Tools
Paper β’ 2503.04625 β’ Published β’ 113 -
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper β’ 2503.14476 β’ Published β’ 141