Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper โข 2603.10145 โข Published Mar 10 โข 13
Running on CPU Upgrade 245 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens ๐ 245 Explore synthetic data benchmarks via an interactive bookshelf
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day โข 640 items โข Updated 11 days ago โข 98
view article Article ๐ช Introduction to Matryoshka Embedding Models +1 tomaarsen, Xenova, osanseviero โข Feb 23, 2024 โข 210
Running Agents Featured 855 Qwen3 Demo ๐ 855 Chat with an AI assistant that thinks before answering