Documentation Index
Fetch the complete documentation index at: https://docs.getpatter.com/llms.txt
Use this file to discover all available pages before exploring further.
Soniox STT
SonioxSTT streams PCM audio to the Soniox real-time transcribe WebSocket (wss://stt-rt.soniox.com/transcribe-websocket). Pure aiohttp transport, no vendor SDK required. The adapter accumulates is_final tokens into segments and flushes them when an <end> / <fin> endpoint token is received.
Install
Usage
Use the namespaced import (
getpatter.stt.soniox) or the flat alias
(getpatter.SonioxSTT). Both auto-resolve SONIOX_API_KEY from the
environment when api_key= is omitted.Models and rates
Soniox bills per minute of streamed audio. Default rate fromgetpatter.pricing:
| Model | Rate / min |
|---|---|
stt-rt-v4 (default) | $0.002 |
stt-rt-v3 | $0.002 |
stt-rt-v2 | $0.002 |
$0.12/hr = $0.002/min. Override per-call via Patter(pricing={"soniox": {"price": ...}}).)
Languages
Soniox real-time uses language hints rather than a single language code — pass a list of BCP-47 codes that the engine considers when scoring tokens:enable_language_identification=True (default) attaches a language code to each token. Real-time STT v4 supports 60+ languages — see the Soniox language matrix.
Telephony optimization
For Twilio (mulaw 8 kHz upstream — Patter converts to PCM s16le before STT):Options
| Option | Default | Notes |
|---|---|---|
api_key | None | Reads from SONIOX_API_KEY when omitted. |
model | "stt-rt-v4" | "stt-rt-v4", "stt-rt-v3", "stt-rt-v2". |
language_hints | None | BCP-47 code list. |
language_hints_strict | False | Restrict to the supplied hints only. |
sample_rate | 16000 | 8000, 16000, 24000 Hz. |
enable_speaker_diarization | False | Server-side speaker IDs. |
enable_language_identification | True | Per-token language codes. |
max_endpoint_delay_ms | 500 | Silence (ms) before endpoint, range [500, 3000]. |

