Marcus Williams PRO
mgkwill
·
AI & ML interests
None yet
Organizations
Reasoning-01
-
Skywork Open Reasoner 1 Technical Report
Paper • 2505.22312 • Published • 54 -
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Paper • 2505.21191 • Published • 3 -
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 186 -
Qwen3 Technical Report
Paper • 2505.09388 • Published • 315
OpenSci
chat-models-candidates
-
nikravan/glm-4vq
Document Question Answering • 7B • Updated • 36 • 36 -
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 22k • 546 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 66 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper • 1910.01108 • Published • 21
Read later
OpenSci
Reasoning-01
-
Skywork Open Reasoner 1 Technical Report
Paper • 2505.22312 • Published • 54 -
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities
Paper • 2505.21191 • Published • 3 -
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 186 -
Qwen3 Technical Report
Paper • 2505.09388 • Published • 315
chat-models-candidates
-
nikravan/glm-4vq
Document Question Answering • 7B • Updated • 36 • 36 -
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 22k • 546 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper • 2401.14196 • Published • 66 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper • 1910.01108 • Published • 21