> ai voice agent

The AI voice agent that lives in your Slack.

Your AI SDR makes the calls. Slack runs the workflow. Mavrick's voice agent handles outbound qualification, inbound speed-to-lead, and post-call CRM logging on a real shipping stack: Telnyx, LiveKit, OpenAI, Cartesia.

Add Mavrick to Slack→ first 10 missions free Speed-to-lead, solved →

Stack:TelnyxLiveKitOpenAI WhisperGPT-5Cartesia (Brandon)

> the category

What an AI voice agent actually is.

An AI voice agent is software that conducts spoken phone conversations using a stack of streaming speech-to-text, large-language-model reasoning, and text-to-speech. Unlike the old IVR (“press 1 for sales”) generation, modern voice agents handle open-ended natural language, can be interrupted mid-sentence, and produce voice output that's increasingly indistinguishable from a human speaker.

The 2026 reference stack converges on a small number of components: OpenAI's Whisper for streaming STT, a frontier LLM (GPT-5 or Claude) for reasoning, a voice-clone provider like Cartesia or ElevenLabs for TTS, and a SIP carrier (Telnyx, Twilio) for the actual phone call. Mavrick runs exactly that stack — and exposes the workflow through Slack rather than through a standalone dashboard.

The category is splitting into two camps. Voice-AI platforms (ElevenLabs, Bland, Vapi, Retell, Synthflow) sell you the voice infrastructure — you build the agent on top. AI coworkers like Mavrick ship the agent assembled, with the Slack-native routing, CRM handoff, compliance gates, and cleared-hot approval architecture pre-wired. Same underlying technology; different unit of value.

> what Mavrick's voice agent does

Six concrete capabilities, shipping.

AI SDR for outbound qualification

Work a target list end-of-day: Mavrick dials each prospect, runs your discovery script, drops a voicemail per template if no answer, books meetings on assigned reps' calendars when leads qualify. The grind that burns out human SDRs, handled while you sleep.

Inbound speed-to-lead response

Form fill on your site → Slack alert in <1s → Mavrick dials within 60s → qualification → meeting booked → handoff card posted. The 14-year-old speed-to-lead problem, solved structurally because AI is the only thing that can hit <60s consistently. Full deep dive at /for/speed-to-lead.

Appointment reminders + confirmations

24h and 1h pre-meeting voice reminders with optional reschedule. Reduces no-show rates by 20-40% vs SMS-only reminders. Posted-back to Slack on confirmation or reschedule.

Post-call CRM logging

Every Mavrick-handled call writes a structured record into your CRM (HubSpot, Salesforce, Pipedrive, Close, Attio): call summary, qualification answers, disposition, next-step recommendation. Eliminates the 'I'll log it after lunch' drift that decays CRM data quality.

Customer-success check-ins

Quarterly NPS-style check-ins with high-value accounts. Mavrick handles the call, captures sentiment and structured feedback, escalates to your CS team if anything flagged. Cheaper than a CSM at-bat for tier-2 accounts; better data than a survey.

Voice-triggered Slack workflows

Customer phones your support number, asks for an account update; Mavrick handles the call and triggers a structured Slack thread for your support team if the request is beyond the agent's authorized scope. Voice becomes an input channel to your team's Slack workflow.

> the stack, layer by layer

Real components. Named.

Most "AI voice agent" pages describe a black-box stack with no component names. We'll name ours. If anything below stops being accurate, the page is the source of truth — we update it on every stack change.

CarrierTelnyx
$1.50/mo per DID + $0.005/min US outbound, ~70% cheaper than Twilio at scale. Shared DID pool with area-code matching for local-presence dialing. On-demand provisioning of new area codes available.
Voice rooms + agent runtimeLiveKit Cloud
Hosts the call audio + agent dispatch + SIP participant handoff. Async-aware Python runtime. Cloud-hosted with global PoPs.
Speech-to-text (STT)OpenAI Whisper
Streaming transcription with sub-200ms latency on most inbound segments. Multilingual support available; default scripts and quality bars are English-only as of mid-2026.
ReasoningGPT-5 (OpenAI)
Workspace-specific discovery script + qualification criteria + disposition logic baked into the system prompt. Interruption handling, partial-response capability, and context retention across multi-turn dialogue.
Text-to-speech (TTS)Cartesia
Default voice 'Brandon' (voice_id 5cad89c9-d88a-4832-89fb-55f2f16d13d3) — A/B tested winner across 5 alternatives on lead-completion rate. Custom voice clones available on Pilot+ tiers.
Compliance gatesFail-closed regulatory layer
National DNC + state DNC + workspace DNC + consent verification all gate fail-closed before any dial. State quiet-hours table seeded for 50 states + DC + 5 territories.
Concurrency safetyPostgres-backed slot reservation
Per-workspace concurrency cap (default 5 simultaneous calls). No one workspace can starve the carrier capacity. Raised on Enterprise tier with dedicated DID pools.
ObservabilityPer-call forensics + audit log
Every call writes a structured record: call_id, lead_id, workspace_id, duration, disposition, swallowed-exception trace if any failure mode. 12-month rolling retention, queryable from /settings/audit.

> vs voice-AI platforms

Coworker vs platform.

Voice-AI platforms give you maximum flexibility and require an engineering team. Mavrick is the assembled coworker. The right pick depends on your team shape.

Tool	Positioning	Pricing
Mavrick (this page)	Assembled AI coworker — voice + Slack + CRM handoff + compliance + cleared-hot approval, shipped	Free + $50/mo
ElevenLabs Conversational AI	Voice-AI platform; you build the agent + integrations + compliance on top	Usage-based, $0.08-$0.30/min
Bland AI	Voice-AI platform with phone-call API; developer-facing	$0.09/min + monthly base
Vapi	Voice-AI infrastructure for builders; conversational pipelines	$0.05-$0.30/min
Retell AI	Voice-AI for SDR + support use cases; developer-focused	Custom + per-minute
Synthflow	Voice AI for SMB sales + support; no-code agent builder	$29-$450/mo tiers

Pricing observed mid-2026 on each vendor's public site. Per-minute pricing on the platforms is usage-based, which compounds quickly at scale; Mavrick's credit model abstracts that into per-mission units that include the carrier pass-through.

> faq

Voice-agent questions answered.

What is an AI voice agent?

An AI voice agent is software that conducts spoken phone conversations using a stack of speech recognition (STT), large language model reasoning, and text-to-speech (TTS). Unlike old-school IVR (interactive voice response), modern AI voice agents handle open-ended natural language, can be interrupted mid-sentence, and produce voice output that's increasingly indistinguishable from a human speaker. The 2026 reference stack is Whisper for STT, a frontier LLM (GPT-5 or Claude) for reasoning, and a voice-clone provider like Cartesia or ElevenLabs for TTS, with a SIP carrier (Telnyx, Twilio) for the actual phone call.

How does Mavrick's AI voice agent work end-to-end?

Inbound flow: form submit on your site → sub-second Slack alert → Mavrick's voice agent dials the lead via Telnyx within 60 seconds → LiveKit room hosts the call → Whisper streams transcription → GPT-5 runs your approved discovery script → Cartesia 'Brandon' voice speaks responses → call ends with meeting booked or disposition logged → structured handoff card posted to your Slack sales channel with transcript, qualification notes, and CRM-record deep link. Outbound flow: similar except the agent dials from a target list instead of from form submits.

What's the actual voice stack — name the components?

Carrier: Telnyx (~$1.50/mo per DID, $0.005/min US outbound — ~70% cheaper than Twilio at scale). Voice rooms + agent runtime: LiveKit Cloud. Speech-to-text: OpenAI Whisper (streaming, ~200ms latency). Reasoning: GPT-5 with your workspace-specific discovery script in the system prompt. Text-to-speech: Cartesia (default voice 'Brandon', voice_id 5cad89c9-d88a-4832-89fb-55f2f16d13d3, A/B tested winner on lead-completion rate). Compliance: per-state quiet-hours table seeded for 50 states + DC + 5 territories; National DNC + workspace DNC + consent verification fail-closed before any dial.

Is the voice quality actually good or does it sound like a robot?

Cartesia is one of the leading voice-clone providers as of 2026. The default 'Brandon' voice is a neutral US-male tone trained on professional voice talent; on blind A/B against four alternatives it won on lead-completion rate. Most ears can't distinguish Cartesia output from a human speaker in a brief sales call. Custom voice clones (your own founder's voice, an existing sales rep, your brand's spokesperson) are available on Pilot+ tiers — you record 3-5 minutes of clean audio, Cartesia trains the clone, you A/B test it against the default before rollout.

What if the lead asks 'are you a real person?'

Mavrick's voice agent identifies as AI when asked. The 'always pretend to be human' anti-pattern is explicitly banned by Mavrick's Constitution at /constitution. If the lead pushes back on AI, Mavrick offers to schedule a human callback, sends an SMS with a calendar link to the assigned rep, and posts the disposition to the Slack channel so your team can follow up personally. Honesty about being AI is a feature, not a bug — modern buyers respond better to transparent AI than to deception.

How does cleared-hot approval work for a voice agent?

Cleared-hot is a one-time script approval, not a per-call approval. You approve the discovery script, the qualification criteria, the disposition logic, and the handoff rules once — typically during workspace setup. From then on, the voice agent executes consistently against the approved playbook on every call. The approval gate kicks in only when the agent wants to mutate something downstream — book a meeting on a specific rep's calendar, update a CRM stage, send a follow-up SMS. Those are one-click confirmations in Slack. The voice call itself runs against your pre-approved script without per-call approval friction.

Does Mavrick handle outbound dialing, or just inbound speed-to-lead?

Both. Outbound: you provide a target list (HubSpot view, Salesforce report, CSV upload, or a Mavrick-generated list from Apollo/Hunter integrations), set the dialing window, and Mavrick works the list — dialing, qualifying, dropping voicemails per a recorded template if no answer, respecting state quiet-hours, posting disposition per call. Inbound: form submit triggers an immediate dial within 60 seconds (see /for/speed-to-lead for the speed-to-lead-specific deep dive). Both use the same voice stack with workspace-configurable scripts per direction.

What about TCPA, state DNC, and quiet-hours compliance?

Mavrick blocks calls against four gates fail-closed before any dial: (1) National DNC list lookup, (2) state-specific DNC list for the lead's normalized state, (3) workspace-specific DNC list (your manually-suppressed numbers), (4) consent verification (opt-in record check against your CRM or webhook). On any gate miss, the dial is blocked, a forensic record is written, and the disposition surfaces in Slack so you can review the suppressed call. Quiet-hours: 50 states + DC + 5 territories seeded with local quiet-hours windows; calls outside the lead's local 8am-9pm window are deferred. You remain the controller of record under TCPA / state telemarketing law — review your own list legality.

How is this different from ElevenLabs Conversational AI, Bland AI, Vapi, Retell AI?

Those are voice-AI platforms — you build the agent on top. You wire the LLM, write the script, integrate with your CRM, build your own Slack handoff layer, manage your own DNC compliance, set up your own concurrency caps. Mavrick is the AI coworker — voice is one capability, but it ships pre-wired with Slack-native routing, CRM handoff, compliance gates, state quiet-hours, cleared-hot approval architecture, and per-workspace concurrency caps. If you have an engineering team and a year of runway, building on a voice-AI platform gives you maximum control. If you're a marketing or sales team that wants the voice agent operating tomorrow, Mavrick is the assembled product.

What does it cost?

Recruit (free): 10 missions to evaluate — roughly 5-8 outbound dials or ~15-20 inbound qualifications, including the per-minute Telnyx pass-through. Pilot ($50/mo): 20,000 credits ≈ 60-100 missions/mo. Enterprise: custom — dedicated DID provisioning, raised concurrency caps, 99.9% SLA, custom voice clones, signed DPA. See /pricing for the full breakdown. No per-seat fees on any tier. Carrier costs (Telnyx) are passed through at our rate, no markup.

> go deeper

The rest of the voice surface.

Speed to Lead — the 14-year-old B2B problem, solved→

For Sales — the AI SDR funnel center→

Best AI voice agents of 2026 — honest ranked listicle→

The Constitution — how Mavrick stays honest on a call→

Pricing — Recruit, Pilot, Enterprise→

3,200+ integrations via managed connectors→

Hear the voice agent on your own phone.

Start free — 10 missions, no credit card. Install Mavrick. Run a test dial. Listen to Brandon qualify a fake lead.

Add Mavrick to Slack→ first 10 missions free