Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models Paper • 2605.07721 • Published 14 days ago • 29
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
Restarting Agents 49 Leaderboard: Physical Reasoning from Video 🏃 49 Submit model evaluations and view leaderboard results