Class: Telnyx::Resources::SpeechToText

Inherits:

Object

Object
Telnyx::Resources::SpeechToText

show all

Defined in:: lib/telnyx/resources/speech_to_text.rb

Instance Method Summary collapse

#initialize(client:) ⇒ SpeechToText constructor private

A new instance of SpeechToText.
#list_providers(provider: nil, service_type: nil, request_options: {}) ⇒ Telnyx::Models::SpeechToTextListProvidersResponse

Some parameter documentations has been truncated, see Models::SpeechToTextListProvidersParams for more details.
#retrieve_transcription(input_format:, transcription_engine:, endpointing: nil, interim_results: nil, keyterm: nil, keywords: nil, language: nil, model: nil, redact: nil, request_options: {}) ⇒ nil

Some parameter documentations has been truncated, see Models::SpeechToTextRetrieveTranscriptionParams for more details.

Constructor Details

#initialize(client:) ⇒ `SpeechToText`

This method is part of a private API. You should avoid using this method if possible, as it may be removed or be changed in the future.

Returns a new instance of SpeechToText.

Parameters:

client (Telnyx::Client)



113
114
115

# File 'lib/telnyx/resources/speech_to_text.rb', line 113

def initialize(client:)
  @client = client
end

Instance Method Details

#list_providers(provider: nil, service_type: nil, request_options: {}) ⇒ `Telnyx::Models::SpeechToTextListProvidersResponse`

Some parameter documentations has been truncated, see Models::SpeechToTextListProvidersParams for more details.

Retrieve the canonical list of supported speech-to-text providers, models, accepted language codes, and the service types each model supports.

Service types:

‘streaming` — standalone WebSocket transcription via `/speech-to-text/transcription`.
‘file_based` — file-based transcription via `/ai/audio/transcriptions`.
‘in_call` — live call transcription via Call Control `transcription_start`.
‘ai_assistant` — STT configured on a Call Control AI Assistant via voice-assistant `TranscriptionConfig` (covers both live-streaming and non-streaming/batch models).

Use this endpoint to discover which (provider, model) combinations are available for the surface you need, and which language codes each accepts. ‘auto` in a `languages` array indicates the provider performs language detection.

Parameters:

provider (Symbol, Telnyx::Models::SpeechToTextListProvidersParams::Provider) —

Filter to entries for a specific STT provider. The enum mirrors the providers ad
service_type (Symbol, Telnyx::Models::SttServiceType) —

Filter to entries that support the given service type. For backward compatibilit
request_options (Telnyx::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(Telnyx::Models::SpeechToTextListProvidersResponse)

#retrieve_transcription(input_format:, transcription_engine:, endpointing: nil, interim_results: nil, keyterm: nil, keywords: nil, language: nil, model: nil, redact: nil, request_options: {}) ⇒ `nil`

Some parameter documentations has been truncated, see Models::SpeechToTextRetrieveTranscriptionParams for more details.

Open a WebSocket connection to stream audio and receive transcriptions in real-time. Authentication is provided via the standard ‘Authorization: Bearer <API_KEY>` header.

Supported engines: ‘Azure`, `Deepgram`, `Google`, `Telnyx`, `xAI`, `Speechmatics`, `Soniox`.

**Connection flow:**

Open WebSocket with query parameters specifying engine, input format, and language.
Send binary audio frames (mp3/wav format).
Receive JSON transcript frames with ‘transcript`, `is_final`, and `confidence` fields.
Close connection when done.

Parameters:

input_format (Symbol, Telnyx::Models::SpeechToTextRetrieveTranscriptionParams::InputFormat) —

The format of input audio stream.
transcription_engine (Symbol, Telnyx::Models::SpeechToTextRetrieveTranscriptionParams::TranscriptionEngine) —

The transcription engine to use for processing the audio stream.
endpointing (Integer) —

Silence duration (in milliseconds) that triggers end-of-speech detection. When s
interim_results (Boolean) —

Whether to receive interim transcription results.
keyterm (String) —

A key term to boost in the transcription. The engine will be more likely to reco
keywords (String) —

Comma-separated list of keywords to boost in the transcription. The engine will
language (String) —

The language spoken in the audio stream.
model (Symbol, Telnyx::Models::SpeechToTextRetrieveTranscriptionParams::Model) —

The specific model to use within the selected transcription engine.
redact (String) —

Enable redaction of sensitive information (e.g., PCI data, SSN) from transcripti
request_options (Telnyx::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(nil)

Class: Telnyx::Resources::SpeechToText

Instance Method Summary collapse

Constructor Details

#initialize(client:) ⇒ SpeechToText

Instance Method Details

#list_providers(provider: nil, service_type: nil, request_options: {}) ⇒ Telnyx::Models::SpeechToTextListProvidersResponse

#retrieve_transcription(input_format:, transcription_engine:, endpointing: nil, interim_results: nil, keyterm: nil, keywords: nil, language: nil, model: nil, redact: nil, request_options: {}) ⇒ nil

#initialize(client:) ⇒ `SpeechToText`

#list_providers(provider: nil, service_type: nil, request_options: {}) ⇒ `Telnyx::Models::SpeechToTextListProvidersResponse`

#retrieve_transcription(input_format:, transcription_engine:, endpointing: nil, interim_results: nil, keyterm: nil, keywords: nil, language: nil, model: nil, redact: nil, request_options: {}) ⇒ `nil`