Tag: ASR

Blog
>
Tag: asr

OpenAI Client update: gpt-realtime GA alignment

OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

Introducing OpenAI Realtime API Client

OpenAI has launched its beta Realtime API, revolutionizing voice assistants with speech-to-speech interactions, ultra-low latency, and realistic voices. Voximplant’s integration makes it easy to connect calls to OpenAI's models, enabling seamless, human-like conversations with minimal setup.

ASR speech recognition STT

What is Automatic Speech Recognition?

How many times a day do you talk to a computer? We’re not referring to the exasperated exclamation you direct at your laptop when it overheats and crashes. We want you to think about the moments you speak to a device and it actually listens.

TTS ASR voximplant kit

Speech synthesis and voice recognition are now twice as fast in Voximplant Kit

Voximplant Kit will soon have a new and improved IVR block with faster and more accurate speech synthesis and recognition. Learn more about the improvements.

ASR speech recognition speech-to-text

Enhanced speech recognition model is now available

62% Word Error Rate (WER) improvement for US English

ASR speech-to-text

Hot Summer Speech-to-Text Updates

Following Google’s release of new Speech API, we are happy to announce improved quality of call records transcription.

ASR ivr speech recognition

High quality Speech Recognition is now available

We are happy to announce the high quality speech recognition for both audio call records transcription and real-time recognition scenarios.

TTS text-to-speech voxengine ASR

High quality Text-to-Speech functionality is now available for all VoxImplant developers

Introducing the Text-to-Speech functionality integrated into VoxEngine.

TTS text-to-speech voice ai realtime

Inworld Text-to-Speech now available in Voximplant

Voximplant has new realtime speech generation for voice AI from Inworld, our latest Voice AI text-to-speech (TTS) partner. Together, we combine state-of-the-art TTS with carrier-grade connectivity so you can build voice agents that sound like your brand, not a generic robot.

New WebSocket privacy feature for compliance-oriented environments

Voximplant has added a WebSocket privacy option that redacts message payloads from logs across all WebSocket-based services – Voice AI connectors and external speech system – and speech control modules

Secrets for secure credential storage

Voximplant has added Secrets, a dedicated credential store for API keys, tokens, and other sensitive values that VoxEngine scenarios need at runtime

voximplant kit news

Voximplant Kit updates. January 2025

New Features in Voximplant Kit: Update overview. We are constantly working to improve our product to make it easier to use and more effective for you. In this update, we have added several useful features. Here’s what’s new:

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

Grok Voice Agent API now available in Voximplant

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

voice ai agent skills

Your AI coding agent can now build on Voximplant

Voximplant AI Agent Skills let your coding agent build and ship voice applications without switching tools

Cartesia Realtime TTS now available in Voximplant

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.

voximplant kit podcast voximplant-kit-cc-news product management voximplant-kit-automation-news web sdk webrtc video kit-updates call center ios sdk sip voximplant pstn api

Tag: ASR

OpenAI Client update: gpt-realtime GA alignment

Introducing OpenAI Realtime API Client

What is Automatic Speech Recognition?

Speech synthesis and voice recognition are now twice as fast in Voximplant Kit

Enhanced speech recognition model is now available

Hot Summer Speech-to-Text Updates

High quality Speech Recognition is now available

High quality Text-to-Speech functionality is now available for all VoxImplant developers

Sign Up for a free Voximplant developer account or talk to our experts

Inworld Text-to-Speech now available in Voximplant

New WebSocket privacy feature for compliance-oriented environments

Secrets for secure credential storage

Voximplant Kit updates. January 2025

Extend Cartesia Line Agents to SIP, WhatsApp, and Global Phone Networks

Grok Voice Agent API now available in Voximplant

Your AI coding agent can now build on Voximplant

Cartesia Realtime TTS now available in Voximplant

Sign Up for a free Voximplant developer account or talk to our experts

Tag: ASR

Sign Up for a free Voximplant developer account or talk to our experts

Sign Up for a free Voximplant developer account or talk to our experts

Contact Us