cpaua
·1 min186

Voxtral: Open-Weights TTS Alternative to ElevenLabs by Mistral

An open alternative to ElevenLabs with open weights has appeared.

Voxtral is a speech synthesis (text-to-speech) model from Mistral:

- only 4 billion parameters
- 70 ms latency for voice agents
- voice cloning from 3 seconds of audio
- 9 languages + cross-lingual transfer
- 68.4% wins compared to ElevenLabs Flash v2.5

Open weights are Hugging Facemistralai/Voxtral-4B-TTS-2603huggingface.co/mistralai/Voxtral-4B-TTS-2603 on Hugging Face.

Share:
Author
cpaua

VibeCode blog admin. Writing about vibe coding, AI and open source.

Comments

To leave a comment, log in or sign up
Loading...

Related articles