kyutai-labs/moshi · gitaskhub

kyutai-labs/moshi↗

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Stars · 10,507

Language · Python

License · Apache-2.0

Ask anything about this repo to start.

Full explanation on explaingit →