Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published Apr 14 • 109 • 6
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published Apr 14 • 109 • 6
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe Paper • 2512.16649 • Published Dec 18, 2025 • 29 • 3