Running 107 Unlocking On-Policy Distillation for Any Model Family π 107 Visualize on-policy distillation for any model family
Running Agents 7 Dataset Length Profiler π 7 Estimate optimal max_length for SFT training with token analysis
Running 3.86k The Ultra-Scale Playbook π 3.86k The ultimate guide to training LLM on large GPU Clusters
Running Agents 88 Large Reasoning Models Leaderboard π³ 88 A leaderboard to rank large reasoning models
Running 600 Scaling test-time compute π 600 Boost LLM answers with flexible testβtime search strategies
Running Agents 430 Reward Bench Leaderboard π 430 Explore and compare model scores on RewardBench benchmarks
Sleeping Agents 103 Huggingface Leaderboard π 103 Show Hugging Face models, datasets & spaces leaderboards
Running on CPU Upgrade 14k Open LLM Leaderboard π 14k Track, rank and evaluate open LLMs and chatbots