Voximplant has new realtime speech generation for voice AI from Inworld, our latest Voice AI text-to-speech (TTS) partner. Together, we combine state-of-the-art TTS with carrier-grade connectivity so you can build voice agents that sound like your brand, not a generic robot.
With speech synthesis support for Microsoft Azure Text-to-Speech, you can choose from more than 425 voice options covering 40 languages and 60 dialects from 6 distinct speech synthesis engine providers.
We recently added IBM Watson™ Text-to-Speech to our list of speech synthesis engine options, expanding the number of voices you dynamically synthesize as part of phone and web calls.
In this digest, we will bring you the latest updates to Voximplant Kit. We have added support for outbound WhatsApp messages, Mobile chats, support for ElevenLabs neural voices, and new automated campaign settings.
Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.
In this digest, we will bring you the latest updates to Voximplant Kit. We have added support for outbound WhatsApp messages, Mobile chats, support for ElevenLabs neural voices, and new automated campaign settings.
Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.
New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers
Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.