VOICE MODEL
GPT-4o Realtime
Native real-time audio API. Understands speech directly without intermediate STT, enabling natural voice-to-voice flow.
TRANSPORT
WebRTC + Twilio
WebRTC handles peer-to-peer audio streaming. Twilio bridges PSTN callers to the WebRTC infrastructure.
TEXT-TO-SPEECH
ElevenLabs
Neural TTS with custom voice cloning. Streaming output to keep end-to-end latency under 800ms.
MEMORY
Redis
Session-scoped conversation memory. Persists context between turns so the agent never loses track of the call.
CRM
HubSpot API
Bidirectional sync — reads prior contact history before the call, writes outcomes, scores, and meetings after.
INFRASTRUCTURE
AWS Lambda
Serverless backend for webhook handling and async CRM writes. Auto-scales under production traffic spikes.