
What is Voice SDK?
Voximplant’s Voice SDK helps you provide a high-quality voice calling experience for your staff and customers.

Voximplant’s Voice SDK helps you provide a high-quality voice calling experience for your staff and customers.

Now Unity developers can use the SDK to embed real-time voice and video communication into VR/AR apps and games in minutes, we will take care of complexity and infrastructure.

If a call is made in non-P2P mode then its media stream goes via our media servers and we can record it if required.

Yep! app for making friendships all around the world is now using Voximplant!

Calls are right inside a chat session for the sake of instant voice connection between sales managers ans customers.

Voximplant now includes a native Cartesia Line / Agents connector that connects any Voximplant call to a Cartesia Line voice agent for real-time, speech-to-speech conversations—over PSTN, SIP, WebRTC, or WhatsApp Business Calling—without building custom media gateways or WebSocket streaming infrastructure.

Connect any Voximplant call to ElevenLabs Conversational AI agents

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

New Features in Voximplant Kit: Update overview. We are constantly working to improve our product to make it easier to use and more effective for you. In this update, we have added several useful features. Here’s what’s new:

In this digest, we will bring you the latest updates to Voximplant Kit. We have added support for outbound WhatsApp messages, Mobile chats, support for ElevenLabs neural voices, and new automated campaign settings.

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.