The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 13 items β’ Updated Nov 18, 2024 β’ 253
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper β’ 2506.01939 β’ Published Jun 2 β’ 185
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. β’ 53 items β’ Updated about 24 hours ago β’ 235
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 340
view article Article From Files to Chunks: Improving Hugging Face Storage Efficiency Nov 20, 2024 β’ 66
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr β’ Feb 7 β’ 245
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper β’ 2501.09686 β’ Published Jan 16 β’ 41
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 β’ 217
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk β’ Oct 7, 2024 β’ 58