meta-llama/Llama-3.2-1B-Instruct Text Generation • 1B • Updated Oct 24, 2024 • 3.87M • • 1.14k
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 25