
Voximplant now supports IPv6 | Usage tips
Voximplant has implemented the support of IPv6 for p2p calls. Since there are some usage nuances, we wrote the article on it.

Voximplant has implemented the support of IPv6 for p2p calls. Since there are some usage nuances, we wrote the article on it.

Make sure to use "catch" method on promises!

HD Text-to-Speech from Google powered by WaveNet is now available to all our developers.

Starting on April 25 we change our callPSTN API: “caller id” is no longer optional and must be specified.

To make our platform better we are now removing the limit, so URL length can be longer than 255 characters.

Voximplant now can save audio without compression.

Both our cloud JavaScript engine and Web JavaScript editor now support modern JavaScript syntax.

The latest version of our Web SDK is fully compatible with the new Safari 11 audio/video calls support.

A brand-new messaging API was added across Web, Android and iOS SDK.

Following Google’s release of new Speech API, we are happy to announce improved quality of call records transcription.

We are happy to announce that video calls that use H.264 video codec can now be recorded. Recorded video calls that use H.264 will be stored as mp4 files (calls with video in VP8 format are stored as webm files).

Now Unity developers can use the SDK to embed real-time voice and video communication into VR/AR apps and games in minutes, we will take care of complexity and infrastructure.

Today Ultravox announced they are directly integrating Voximplant into their platform to provide SIP capabilities. The integration builds on Voximplant’s deep telephony and Voice AI tooling

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

Boost your food tech app in 2024! Learn 12 in-app content tricks from a study of 5000+ stories. Personalize, gamify, and use cross-channel messaging for user retention.

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.

OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.