Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning Paper β’ 2606.10968 β’ Published 3 days ago β’ 41
Running on CPU Upgrade 14k Open LLM Leaderboard π 14k Track, rank and evaluate open LLMs and chatbots
Running Featured 1.05k Can You Run It? LLM version π 1.05k Check if your GPU can run a chosen LLM model
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17, 2024 β’ 58
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 13 items β’ Updated Nov 18, 2024 β’ 266
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 50 items β’ Updated Mar 13 β’ 690
Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance Paper β’ 2305.17306 β’ Published May 26, 2023 β’ 2