Class: Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1ReinforcementTuningSpec

Inherits:
Object
  • Object
show all
Includes:
Core::Hashable, Core::JsonObjectSupport
Defined in:
lib/google/apis/aiplatform_v1beta1/classes.rb,
lib/google/apis/aiplatform_v1beta1/representations.rb,
lib/google/apis/aiplatform_v1beta1/representations.rb

Overview

Tuning spec for Reinforcement Tuning.

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(**args) ⇒ GoogleCloudAiplatformV1beta1ReinforcementTuningSpec

Returns a new instance of GoogleCloudAiplatformV1beta1ReinforcementTuningSpec.



47354
47355
47356
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47354

def initialize(**args)
   update!(**args)
end

Instance Attribute Details

#composite_reward_configGoogle::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1CompositeReinforcementTuningRewardConfig

Composite reward function configuration for reinforcement tuning. Corresponds to the JSON property compositeRewardConfig



47326
47327
47328
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47326

def composite_reward_config
  @composite_reward_config
end

#hyper_parametersGoogle::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1ReinforcementTuningHyperParameters

Hyperparameters for Reinforcement Tuning. Corresponds to the JSON property hyperParameters



47331
47332
47333
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47331

def hyper_parameters
  @hyper_parameters
end

#single_reward_configGoogle::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1SingleReinforcementTuningRewardConfig

SingleReinforcementTuningRewardConfig defines a single reward function configuration for RL tuning. Each reward calculation/evaluation consists of two stages: 1. Stage 1: Parses the part of information important from sample response via regex extract, or simply takes the sample response unmodified. 2. Stage 2: Calls the configured reward scorer to compute the reward. Corresponds to the JSON property singleRewardConfig



47340
47341
47342
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47340

def single_reward_config
  @single_reward_config
end

#training_dataset_uriString

Cloud Storage path to the file containing training dataset for tuning. The dataset must be formatted as a JSONL file. Corresponds to the JSON property trainingDatasetUri

Returns:

  • (String)


47346
47347
47348
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47346

def training_dataset_uri
  @training_dataset_uri
end

#validation_dataset_uriString

Cloud Storage path to the file containing validation dataset for tuning. The dataset must be formatted as a JSONL file. Corresponds to the JSON property validationDatasetUri

Returns:

  • (String)


47352
47353
47354
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47352

def validation_dataset_uri
  @validation_dataset_uri
end

Instance Method Details

#update!(**args) ⇒ Object

Update properties of this object



47359
47360
47361
47362
47363
47364
47365
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47359

def update!(**args)
  @composite_reward_config = args[:composite_reward_config] if args.key?(:composite_reward_config)
  @hyper_parameters = args[:hyper_parameters] if args.key?(:hyper_parameters)
  @single_reward_config = args[:single_reward_config] if args.key?(:single_reward_config)
  @training_dataset_uri = args[:training_dataset_uri] if args.key?(:training_dataset_uri)
  @validation_dataset_uri = args[:validation_dataset_uri] if args.key?(:validation_dataset_uri)
end