← Back to feed

LongLive

GitHub Repo Pretty sure · future-dated news
https://github.com/NVlabs/LongLive

NVIDIA's serious attempt at real-time long-form video generation—actually ships inference code, quantization work, and production infra instead of just talking about it.

5%
30%
65%
Slop 5%Signal 30%Science 65%

LongLive 2.0 is genuine infrastructure work: NVFP4 quantization (W4A4), sequence parallelism for both training and inference, streaming VAE, async decoding. The paper is real (ICLR-2026 acceptance), models are downloadable, and the code examples show actual inference pipelines—not marketing fluff. The 45.7 FPS claim with NVFP4-2Step on 5B params is measurable. BUT: this is primarily an engineering/systems contribution, not novel research (attention sink and KV-recache were 1.0 concepts). The ...

1799 stars Python 2026-05-21 243 days old

Become a MFer to rate — log in