Class: Telnyx::Resources::TextToSpeech

Inherits:

Object

Object
Telnyx::Resources::TextToSpeech

show all

Defined in:: lib/telnyx/resources/text_to_speech.rb

Overview

Text to speech streaming command operations

Instance Method Summary collapse

#generate_speech(aws: nil, azure: nil, disable_cache: nil, elevenlabs: nil, language: nil, minimax: nil, output_type: nil, provider: nil, resemble: nil, rime: nil, telnyx: nil, text: nil, text_type: nil, voice: nil, voice_settings: nil, xai: nil, request_options: {}) ⇒ Telnyx::Models::TextToSpeechGenerateSpeechResponse

Some parameter documentations has been truncated, see Models::TextToSpeechGenerateSpeechParams for more details.
#initialize(client:) ⇒ TextToSpeech constructor private

A new instance of TextToSpeech.
#list_voices(api_key: nil, provider: nil, request_options: {}) ⇒ Telnyx::Models::TextToSpeechListVoicesResponse

Retrieve a list of available voices from one or all TTS providers.
#retrieve_speech(audio_format: nil, disable_cache: nil, model_id: nil, provider: nil, socket_id: nil, voice: nil, voice_id: nil, request_options: {}) ⇒ nil

Some parameter documentations has been truncated, see Models::TextToSpeechRetrieveSpeechParams for more details.

Constructor Details

#initialize(client:) ⇒ `TextToSpeech`

This method is part of a private API. You should avoid using this method if possible, as it may be removed or be changed in the future.

Returns a new instance of TextToSpeech.

Parameters:

client (Telnyx::Client)



171
172
173

# File 'lib/telnyx/resources/text_to_speech.rb', line 171

def initialize(client:)
  @client = client
end

Instance Method Details

#generate_speech(aws: nil, azure: nil, disable_cache: nil, elevenlabs: nil, language: nil, minimax: nil, output_type: nil, provider: nil, resemble: nil, rime: nil, telnyx: nil, text: nil, text_type: nil, voice: nil, voice_settings: nil, xai: nil, request_options: {}) ⇒ `Telnyx::Models::TextToSpeechGenerateSpeechResponse`

Some parameter documentations has been truncated, see Models::TextToSpeechGenerateSpeechParams for more details.

Generate synthesized speech audio from text input. Returns audio in the requested format (binary audio stream, base64-encoded JSON, or an audio URL for later retrieval).

Authentication is provided via the standard ‘Authorization: Bearer <API_KEY>` header.

The ‘voice` parameter provides a convenient shorthand to specify provider, model, and voice in a single string (e.g. `telnyx.NaturalHD.Alloy` or `Telnyx.Ultra.<voice_id>`). Alternatively, specify `provider` explicitly along with provider-specific parameters.

Supported providers: ‘aws`, `telnyx`, `azure`, `elevenlabs`, `minimax`, `rime`, `resemble`, `xai`.

The Telnyx ‘Ultra` model supports 44 languages with emotion control, speed adjustment, and volume control. Use the `telnyx` provider-specific parameters to configure these features.

Parameters:

aws (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Aws) —

AWS Polly provider-specific parameters.
azure (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Azure) —

Azure Cognitive Services provider-specific parameters.
disable_cache (Boolean) —

When ‘true`, bypass the audio cache and generate fresh audio.
elevenlabs (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Elevenlabs) —

ElevenLabs provider-specific parameters.
language (String) —

Language code (e.g. ‘en-US`). Usage varies by provider.
minimax (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Minimax) —

Minimax provider-specific parameters.
output_type (Symbol, ::Telnyx::Models::TextToSpeechGenerateSpeechParams::OutputType) —

Determines the response format. ‘binary_output` returns raw audio bytes, `base64
provider (Symbol, ::Telnyx::Models::TextToSpeechGenerateSpeechParams::Provider) —

TTS provider. Required unless ‘voice` is provided.
resemble (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Resemble) —

Resemble AI provider-specific parameters.
rime (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Rime) —

Rime provider-specific parameters.
telnyx (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Telnyx) —

Telnyx provider-specific parameters. Use ‘voice_speed` and `temperature` for `Na
text (String) —

The text to convert to speech.
text_type (Symbol, ::Telnyx::Models::TextToSpeechGenerateSpeechParams::TextType) —

Text type. Use ‘ssml` for SSML-formatted input (supported by AWS and Azure).
voice (String) —

Voice identifier in the format ‘provider.model_id.voice_id` or `provider.voice_i
voice_settings (Hash{Symbol=>Object}) —

Provider-specific voice settings. Contents vary by provider — see provider-speci
xai (::Telnyx::Models::TextToSpeechGenerateSpeechParams::Xai) —

xAI provider-specific parameters.
request_options (Telnyx::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(Telnyx::Models::TextToSpeechGenerateSpeechResponse)

#list_voices(api_key: nil, provider: nil, request_options: {}) ⇒ `Telnyx::Models::TextToSpeechListVoicesResponse`

Retrieve a list of available voices from one or all TTS providers. When ‘provider` is specified, returns voices for that provider only. Otherwise, returns voices from all providers.

Some providers (ElevenLabs, Resemble) require an API key to list voices.

Parameters:

api_key (String) —

API key for providers that require one to list voices (e.g. ElevenLabs).
provider (Symbol, Telnyx::Models::TextToSpeechListVoicesParams::Provider) —

Filter voices by provider. If omitted, voices from all providers are returned.
request_options (Telnyx::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(Telnyx::Models::TextToSpeechListVoicesResponse)

#retrieve_speech(audio_format: nil, disable_cache: nil, model_id: nil, provider: nil, socket_id: nil, voice: nil, voice_id: nil, request_options: {}) ⇒ `nil`

Some parameter documentations has been truncated, see Models::TextToSpeechRetrieveSpeechParams for more details.

Open a WebSocket connection to stream text and receive synthesized audio in real time. Authentication is provided via the standard ‘Authorization: Bearer <API_KEY>` header. Send JSON frames with text to synthesize; receive JSON frames containing base64-encoded audio chunks.

Supported providers: ‘aws`, `telnyx`, `azure`, `murfai`, `minimax`, `rime`, `resemble`, `elevenlabs`, `xai`.

**Connection flow:**

Open WebSocket with query parameters specifying provider, voice, and model.
Send an initial handshake message ‘“ ”` (single space) with optional `voice_settings` to initialize the session.
Send text messages as ‘“Hello world”`.
Receive audio chunks as JSON frames with base64-encoded audio.
A final frame with ‘isFinal: true` indicates the end of audio for the current text.

To interrupt and restart synthesis mid-stream, send ‘true` — the current worker is stopped and a new one is started.

Note: The Telnyx ‘Ultra` model is not available over WebSocket. Use the HTTP POST `/text-to-speech/speech` endpoint instead.

Parameters:

audio_format (Symbol, Telnyx::Models::TextToSpeechRetrieveSpeechParams::AudioFormat) —

Audio output format override. Supported for Telnyx models. ‘pcm` and `wav` are a
disable_cache (Boolean) —

When ‘true`, bypass the audio cache and generate fresh audio.
model_id (String) —

Model identifier for the chosen provider. Examples: ‘Natural`, `NaturalHD`, `Ult
provider (Symbol, Telnyx::Models::TextToSpeechRetrieveSpeechParams::Provider) —

TTS provider. Defaults to ‘telnyx` if not specified. Ignored when `voice` is pro
socket_id (String) —

Client-provided socket identifier for tracking. If not provided, one is generate
voice (String) —

Voice identifier in the format ‘provider.model_id.voice_id` or `provider.voice_i
voice_id (String) —

Voice identifier for the chosen provider.
request_options (Telnyx::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(nil)

Class: Telnyx::Resources::TextToSpeech

Overview

Instance Method Summary collapse

Constructor Details

#initialize(client:) ⇒ TextToSpeech

Instance Method Details

#list_voices(api_key: nil, provider: nil, request_options: {}) ⇒ Telnyx::Models::TextToSpeechListVoicesResponse

#retrieve_speech(audio_format: nil, disable_cache: nil, model_id: nil, provider: nil, socket_id: nil, voice: nil, voice_id: nil, request_options: {}) ⇒ nil

#initialize(client:) ⇒ `TextToSpeech`

#list_voices(api_key: nil, provider: nil, request_options: {}) ⇒ `Telnyx::Models::TextToSpeechListVoicesResponse`

#retrieve_speech(audio_format: nil, disable_cache: nil, model_id: nil, provider: nil, socket_id: nil, voice: nil, voice_id: nil, request_options: {}) ⇒ `nil`