🔹 True Single-GPU Extreme Speed ⚡️ No need to rely on traditional workarounds like KV-cache, quantization, sparse/linear attention, or TinyVAE. Helios hits an end-to-end 19.5 FPS on a single H100!
Training is also highly accessible: an 80GB VRAM can fit four 14B models.
🔹 Solving Long-Video "Drift" from the Core 🎥 Tired of visual drift and repetitive loops? We ditched traditional hacks (like error banks, self-forcing, or keyframe sampling).
Instead, our innovative training strategy simulates & eliminates drift directly, keeping minute-long videos incredibly coherent with stunning quality. ✨
🔹 3 Model Variants for Full Coverage 🛠️ With a unified architecture natively supporting T2V, I2V, and V2V, we are open-sourcing 3 flavors:
1️⃣ Base: Single-stage denoising for extreme high-fidelity. 2️⃣ Mid: Pyramid denoising + CFG-Zero for the perfect balance of quality & throughput. 3️⃣ Distilled: Adversarial Distillation (DMD) for ultra-fast, few-step generation.
🔹 Day-0 Ecosystem Ready 🌍 We wanted deployment to be a breeze from the second we launched. Helios drops with comprehensive Day-0 hardware and framework support:
My USB charger has a Blackwell GPU and 128GB RAM. What. A. Time. To. Be. Alive. People in Sofia: “It’s freezing.” Me: sitting next to 3kW of space AI heaters on my desk 👀 1x GLM-5, 2x MiniMax-M2.5, 1x Qwen3 Coder Next; all on single Aibrix/K8s cluster
I just pushed Claude Code Agent Swarm with 20 coding agents on my desktop GPU workstation.
With local AI, I don’t have /fast CC switch, but I have /absurdlyfast: - 100’499 tokens/second read, yeah 100k, not a typo | 811 tok/sec generation - KV cache: 707’200 tokens - Hardware: 5+ year old GPUs 4xA6K gen1; It’s not the car. It’s the driver.
Qwen3 Coder Next AWQ with cache at BF16. Scores 82.1% in C# on 29-years-in-dev codebase vs Opus 4.5 at only 57.5%. When your codebase predates Stack Overflow, you don't need the biggest model; you need the one that actually remembers Windows 95.
My current bottleneck is my 27" monitor. Can't fit all 20 Theos on screen without squinting.