v

voicebox

by jamiepine
🔓 Open Source TypeScript 🌍 Global free

About

Voicebox is a local-first, open-source AI voice studio by jamiepine, offering a private alternative to ElevenLabs and WisprFlow. It features high-fidelity voice cloning, speech generation across 23 languages via 7 different engines (like Qwen3-TTS and Kokoro), and global dictation via STT. With its native MCP server, it enables AI agents such as Claude and Cursor to speak using custom-cloned voices, ensuring all voice data and models remain securely on the user's local machine.

Features

  • Multi-engine zero-shot voice cloning
  • Built-in MCP server for AI agents
  • 23 languages & 7 TTS engines support
  • Global dictation with hotkey support
  • 100% local inference and privacy

Supported Platforms

desktop