voicebox
by jamiepine
About
Voicebox is a local-first, open-source AI voice studio by jamiepine, offering a private alternative to ElevenLabs and WisprFlow. It features high-fidelity voice cloning, speech generation across 23 languages via 7 different engines (like Qwen3-TTS and Kokoro), and global dictation via STT. With its native MCP server, it enables AI agents such as Claude and Cursor to speak using custom-cloned voices, ensuring all voice data and models remain securely on the user's local machine.
Features
- Multi-engine zero-shot voice cloning
- Built-in MCP server for AI agents
- 23 languages & 7 TTS engines support
- Global dictation with hotkey support
- 100% local inference and privacy
Supported Platforms
desktop