Running Featured 58 Distilling 100B+ Models 40x Faster with TRL 📝 58 TRL distillation for 100B+ teachers, 40x faster
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 5B • Updated 11 days ago • 14.9k • 29
Jackrong/Qwen3.5-0.8B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 0.9B • Updated Mar 6 • 1.86k • 10
Jackrong/MLX-Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-4bit Text Generation • 0.7B • Updated 29 days ago • 1.17k • 5
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated 11 days ago • 407k • 568