Documentation Index
Fetch the complete documentation index at: https://docs.getpatter.com/llms.txt
Use this file to discover all available pages before exploring further.
Cartesia TTS
CartesiaTTS is a Patter TTSProvider backed by Cartesia’s /tts/bytes HTTP endpoint. It streams raw PCM_S16LE chunks that drop directly into Patter’s pipeline with no transcoding. Pure aiohttp transport, no vendor SDK required. The default model is sonic-3 (current GA snapshot, ~90 ms TTFB target).
This page covers Cartesia TTS. For Cartesia’s
ink-whisper STT see
the Cartesia STT page.Install
Usage
Use the namespaced import (
getpatter.tts.cartesia) or the flat alias
(getpatter.CartesiaTTS). Both auto-resolve CARTESIA_API_KEY from the
environment when api_key= is omitted.Models and rates
Cartesia TTS bills per 1,000 characters synthesized. Per-model rates (defaults fromgetpatter.pricing):
| Model | Rate / 1k chars | Notes |
|---|---|---|
sonic-3 (default) | $0.030 | GA, ~90 ms TTFB. Drop-in compatible with sonic-2 voice IDs. |
sonic-2 | $0.030 | Previous flagship. |
sonic-1 / sonic-english | $0.030 | Legacy English. |
sonic-multilingual | $0.030 | Multilingual variant. |
Patter(pricing={"cartesia_tts": {"models": {"sonic-3": {"price": ...}}}}).
Languages
language="en" by default. Sonic-3 supports 30+ languages — pass any Cartesia-supported BCP-47 code ("es", "fr", "de", "it", "pt", "ja", "zh", …). Voice cloning works across all of them — the same voice ID will speak whichever language you target.
Telephony optimization
Use the carrier-aware factories so audio reaches Patter’s telephony adapter at the carrier’s native sample rate, skipping a resample step:Options
| Option | Default | Notes |
|---|---|---|
api_key | None | Reads from CARTESIA_API_KEY when omitted. |
model | "sonic-3" | Any Cartesia TTS model id. |
voice | "f786b574-..." (Katie) | Cartesia voice id. |
language | "en" | BCP-47 code. |
sample_rate | 16000 | 8000, 16000, 22050, 24000, 44100 Hz. |
speed | None | "fastest" ... "slowest" or float in [0.6, 2.0]. |
emotion | None | Cartesia emotion preset. |
volume | None | Float in [0.5, 2.0] (sonic-3 only). |
base_url | Cartesia API | Override for proxying. |
api_version | "2025-04-16" | Cartesia API version pin. |

