Class: Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpec

Inherits:

Object

Object
Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpec

show all

Includes:: Core::Hashable, Core::JsonObjectSupport

Defined in:: lib/google/apis/aiplatform_v1/classes.rb,
lib/google/apis/aiplatform_v1/representations.rb,
lib/google/apis/aiplatform_v1/representations.rb

Overview

Configuration for Speculative Decoding.

Instance Attribute Summary collapse

#draft_model_speculation ⇒ Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecDraftModelSpeculation
Draft model speculation works by using the smaller model to generate candidate tokens for speculative decoding.
#ngram_speculation ⇒ Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecNgramSpeculation
N-Gram speculation works by trying to find matching tokens in the previous prompt sequence and use those as speculation for generating new tokens.
#speculative_token_count ⇒ Fixnum
The number of speculative tokens to generate at each step.

Instance Method Summary collapse

#initialize(**args) ⇒ GoogleCloudAiplatformV1SpeculativeDecodingSpec constructor
A new instance of GoogleCloudAiplatformV1SpeculativeDecodingSpec.
#update!(**args) ⇒ Object
Update properties of this object.

Constructor Details

#initialize(**args) ⇒ `GoogleCloudAiplatformV1SpeculativeDecodingSpec`

Returns a new instance of GoogleCloudAiplatformV1SpeculativeDecodingSpec.



34938
34939
34940

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 34938

def initialize(**args)
   update!(**args)
end

Instance Attribute Details

#draft_model_speculation ⇒ `Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecDraftModelSpeculation`

Draft model speculation works by using the smaller model to generate candidate tokens for speculative decoding. Corresponds to the JSON property draftModelSpeculation

Returns:

(Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecDraftModelSpeculation)



34925
34926
34927

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 34925

def draft_model_speculation
  @draft_model_speculation
end

#ngram_speculation ⇒ `Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecNgramSpeculation`

N-Gram speculation works by trying to find matching tokens in the previous prompt sequence and use those as speculation for generating new tokens. Corresponds to the JSON property ngramSpeculation

Returns:

(Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecNgramSpeculation)



34931
34932
34933

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 34931

def ngram_speculation
  @ngram_speculation
end

#speculative_token_count ⇒ `Fixnum`

The number of speculative tokens to generate at each step. Corresponds to the JSON property speculativeTokenCount

Returns:

(Fixnum)



34936
34937
34938

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 34936

def speculative_token_count
  @speculative_token_count
end

Instance Method Details

#update!(**args) ⇒ `Object`

Update properties of this object

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 34943

def update!(**args)
  @draft_model_speculation = args[:draft_model_speculation] if args.key?(:draft_model_speculation)
  @ngram_speculation = args[:ngram_speculation] if args.key?(:ngram_speculation)
  @speculative_token_count = args[:speculative_token_count] if args.key?(:speculative_token_count)
end

Class: Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpec

Overview

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(**args) ⇒ GoogleCloudAiplatformV1SpeculativeDecodingSpec

Instance Attribute Details

#draft_model_speculation ⇒ Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecDraftModelSpeculation

#ngram_speculation ⇒ Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecNgramSpeculation

#speculative_token_count ⇒ Fixnum

Instance Method Details

#update!(**args) ⇒ Object

#initialize(**args) ⇒ `GoogleCloudAiplatformV1SpeculativeDecodingSpec`

#draft_model_speculation ⇒ `Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecDraftModelSpeculation`

#ngram_speculation ⇒ `Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SpeculativeDecodingSpecNgramSpeculation`

#speculative_token_count ⇒ `Fixnum`

#update!(**args) ⇒ `Object`