← Back to feed

personaplex

GitHub Repo Pretty sure · Moshi weights matter here
https://github.com/NVIDIA/personaplex

NVIDIA ships a real-time two-way speech AI that actually talks back without waiting — rare tangible progress on the hardest part of voice UX.

15%
25%
60%
Slop 15%Signal 25%Science 60%

This is a legit finetuning + systems work contribution: full-duplex speech (speak while listening), persona/voice control, and sub-100ms latency aren't trivial. The paper exists (arxiv 2602.06053), weights are published, and the repo has actual inference code—not a wrapper around a closed API. Downsides: finetuned Moshi weights (borrowed credibility), narrow training distribution (synthetic conversations + Fisher corpus), and 'generalization' section is basically "we encourage you to find wei...

7909 stars Python 2026-03-02 92 days old

Become a MFer to rate — log in