Private voice AI infrastructure

aywa
runtime

Deploy production AI phone agents on your own VPS, cloud, or Kubernetes cluster. Keep your providers, data flow, call capacity, and runtime updates under your control. Calls, transcripts, recordings, and provider secrets stay inside the infrastructure you deploy.

1+ instances by plan
BYO LLM, STT, TTS, SIP
HA Postgres, Redis, S3
Local audio, transcripts, secrets
OpenAI Deepgram ElevenLabs Cartesia ClickHouse
prod-eu-01
$ aywa deploy --target vps-01
runtime registered
license active
voice pipeline ready
clickhouse sink connected
sip edge reachable

Runtime integrations

Adapters already wired into the runtime.

The provider grid reflects what the runtime can actually route today: model adapters, OpenAI-compatible LLM routes, streaming STT, telephony-ready TTS, carrier/SIP/WebRTC, and the production data plane.

LLM and reasoning

Model layer

OpenAI Azure OpenAI Anthropic Google Gemini Groq Mistral Together AI OpenRouter DeepInfra Perplexity Cerebras xAI Anyscale Inflection Custom LLM
Speech in

Streaming STT

Deepgram nova-3 AssemblyAI Gladia Speechmatics Soniox
Speech out

Telephony TTS

ElevenLabs Cartesia OpenAI TTS Deepgram Aura LMNT Azure Speech PlayHT RimeAI
Carrier edge

Telephony and media

BYO SIP FreeSWITCH Twilio Telnyx Vonage LiveKit Daily
State and analytics

Runtime data plane

Postgres Redis / Valkey ClickHouse S3 R2 MinIO Supabase
Ops and security

Production controls

API keys Webhook signing Outbox retries Tool allowlists Call leases Readiness gates Support bundles Signed releases

Voice experience engine

Optimized for the turn, not married to one provider.

The runtime is built around the real-time phone experience: deciding when the user has finished speaking, streaming the model early, keeping TTS playable on telephony, and measuring latency across every stage.

TM Turn management

Semantic turn classifier, duplicate final suppression, line-check filtering, assistant-active deferral, and fast opening/post-opening paths.

VQ Voice quality

Telephony audio normalization, loudness leveling, soft limiter, speed adjustment, TTS continuity, and a single-output call mixer.

LT Latency controls

STT preconnect, streaming LLM first-delta tracking, segmented TTS playout, provider retries, fallback voices, and runtime SLO metrics.

WH Tools and webhooks

Function tools, native call controls, signed server messages, webhook outbox retries, redacted logs, and outbound request allowlists.

Why private runtime

For teams outgrowing hosted voice-agent constraints.

Hosted platforms are excellent for prototypes and managed launch speed. Aywa Runtime is built for teams that are ready to own the execution layer: cost model, capacity planning, provider routing, privacy boundary, and production debugging.

CM Cost model clarity

License the runtime separately and keep STT, LLM, TTS, telephony, storage, and analytics billing with the providers you choose.

CP Capacity under your control

Scale on your VPS, cloud, or Kubernetes design instead of treating every concurrency decision as a hosted-plan constraint.

DB Explicit data boundary

Audio, transcripts, recordings, tool payloads, provider secrets, and runtime logs stay in the infrastructure you operate.

DX Production debugging

Inspect calls from inside the runtime: provider timing, endpointing, tool attempts, webhook retries, and redacted support bundles.

Architecture

Control plane outside. Runtime inside your infrastructure.

The SaaS manages licensing, releases, instance health, and migration workflows. The call path, provider secrets, recordings, and telephony edge stay in the customer deployment.

Aywa SaaS Control plane
LS
License server signed activation, grace period
RG
Private registry signed images, stable channel
MC
Migration center imports, mappings, dry-runs
Customer VPC / VPS / Kubernetes Private runtime plane
Voice ingress SIP trunk Twilio / Telnyx / Vonage WebRTC bridge
aywa runtime Real-time turn engine

STT, LLM streaming, TTS, barge-in, tools, webhooks, retries, and call state stay in process.

Turn manager Tool runner Webhook outbox Fallbacks
BYO providers OpenAI / Anthropic / Gemini Deepgram / Gladia / AssemblyAI ElevenLabs / Cartesia / OpenAI TTS
Postgres assistants, calls, config
Redis leases, realtime state
ClickHouse latency and quality metrics
S3 / R2 / MinIO recordings and artifacts
No audio or transcripts in Aywa SaaS Provider secrets remain local Support bundles redact by default

Data residency and GDPR posture

Your calls stay in your infrastructure.

The call path, audio, transcripts, recordings, provider credentials, and webhook payloads are processed by the runtime you deploy. Aywa Control Plane only needs licensing, release, health, and migration metadata.

EU Data stays where you deploy

Run the runtime, database, object storage, analytics, and SIP edge in your chosen region or VPS.

AV No call audio through Aywa SaaS

The control plane does not sit in the media path for phone calls, recordings, or transcripts.

SK Provider secrets remain local

OpenAI, Deepgram, ElevenLabs, SIP, storage, and webhook secrets belong to the customer deployment.

RT Retention is controlled by you

Use your own Postgres, ClickHouse, and S3-compatible storage policies for logs and artifacts.

Deployment

One command for trials. Hardened paths for production.

Start on a VPS, then graduate to multi-node deployments with Postgres documents, Redis leases, S3-compatible artifacts, ClickHouse analytics, and controlled release channels.

install.aywaruntime.com
curl -fsSL https://install.aywaruntime.com | \
  sudo AYWA_TOKEN="$AYWA_INSTALL_TOKEN" bash

aywa status
aywa update --channel stable
aywa support-bundle --redact

Migration center

Bring existing assistant configurations into Aywa.

Preview imports, map source IDs to runtime IDs, reconnect redacted credentials, and convert imported resources into Aywa's own runtime schema.

01 Connect source

Use a short-lived source API key for a preview or import request.

02 Preview mapping

Inspect assistants, tools, credentials, files, numbers, webhooks, and ID remaps before writing.

03 Run on your runtime

Reconnect secrets, attach providers, and place calls through the private deployment.

Private runtime pricing

Start monthly. Commit annually when it is production.

License the runtime, run it where you want, and keep provider billing between you and your providers. No Aywa platform markup is added to call minutes.

Trial
$0 / 14 days

No credit card required for qualified technical evaluations.

One runtime instance to validate deployment, import flow, provider credentials, and call quality.

  • 1 private runtime instance
  • Installer and Docker Compose
  • Trial license token
Start trial
Starter
$49 / mo

Billed monthly for teams validating private runtime operations.

For dev, staging, and small production deployments that need a private runtime.

  • 1 runtime instance
  • Stable release channel
  • Stable updates
Choose Starter
Pro
$99 / mo

Billed monthly when you want production without annual approval friction.

Two private runtime instances for production teams that want predictable infra pricing.

  • 2 runtime instances
  • Private registry access
  • Email support and migration tooling
Choose Pro
Agency
$249 / mo

Billed monthly for agencies standardizing private deployments.

For teams managing multiple client deployments with repeatable templates and support needs.

  • 5 runtime instances
  • Reusable deployment templates
  • Priority support
Choose Agency
Enterprise
Custom

Monthly or annual terms available for larger deployments.

For regulated teams that need onboarding, SLA, audit, private builds, or custom hardening.

  • 10+ runtime instances
  • SLA and onboarding
  • Private release policies
Contact sales