Class: Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1ReinforcementTuningSpec
- Inherits:
-
Object
- Object
- Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1ReinforcementTuningSpec
- Includes:
- Core::Hashable, Core::JsonObjectSupport
- Defined in:
- lib/google/apis/aiplatform_v1beta1/classes.rb,
lib/google/apis/aiplatform_v1beta1/representations.rb,
lib/google/apis/aiplatform_v1beta1/representations.rb
Overview
Tuning spec for Reinforcement Tuning.
Instance Attribute Summary collapse
-
#composite_reward_config ⇒ Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1CompositeReinforcementTuningRewardConfig
Composite reward function configuration for reinforcement tuning.
-
#hyper_parameters ⇒ Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1ReinforcementTuningHyperParameters
Hyperparameters for Reinforcement Tuning.
-
#single_reward_config ⇒ Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1SingleReinforcementTuningRewardConfig
SingleReinforcementTuningRewardConfig defines a single reward function configuration for RL tuning.
-
#training_dataset_uri ⇒ String
Cloud Storage path to the file containing training dataset for tuning.
-
#validation_dataset_uri ⇒ String
Cloud Storage path to the file containing validation dataset for tuning.
Instance Method Summary collapse
-
#initialize(**args) ⇒ GoogleCloudAiplatformV1beta1ReinforcementTuningSpec
constructor
A new instance of GoogleCloudAiplatformV1beta1ReinforcementTuningSpec.
-
#update!(**args) ⇒ Object
Update properties of this object.
Constructor Details
#initialize(**args) ⇒ GoogleCloudAiplatformV1beta1ReinforcementTuningSpec
Returns a new instance of GoogleCloudAiplatformV1beta1ReinforcementTuningSpec.
47354 47355 47356 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47354 def initialize(**args) update!(**args) end |
Instance Attribute Details
#composite_reward_config ⇒ Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1CompositeReinforcementTuningRewardConfig
Composite reward function configuration for reinforcement tuning.
Corresponds to the JSON property compositeRewardConfig
47326 47327 47328 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47326 def composite_reward_config @composite_reward_config end |
#hyper_parameters ⇒ Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1ReinforcementTuningHyperParameters
Hyperparameters for Reinforcement Tuning.
Corresponds to the JSON property hyperParameters
47331 47332 47333 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47331 def hyper_parameters @hyper_parameters end |
#single_reward_config ⇒ Google::Apis::AiplatformV1beta1::GoogleCloudAiplatformV1beta1SingleReinforcementTuningRewardConfig
SingleReinforcementTuningRewardConfig defines a single reward function
configuration for RL tuning. Each reward calculation/evaluation consists of
two stages: 1. Stage 1: Parses the part of information important from sample
response via regex extract, or simply takes the sample response unmodified. 2.
Stage 2: Calls the configured reward scorer to compute the reward.
Corresponds to the JSON property singleRewardConfig
47340 47341 47342 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47340 def single_reward_config @single_reward_config end |
#training_dataset_uri ⇒ String
Cloud Storage path to the file containing training dataset for tuning. The
dataset must be formatted as a JSONL file.
Corresponds to the JSON property trainingDatasetUri
47346 47347 47348 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47346 def training_dataset_uri @training_dataset_uri end |
#validation_dataset_uri ⇒ String
Cloud Storage path to the file containing validation dataset for tuning. The
dataset must be formatted as a JSONL file.
Corresponds to the JSON property validationDatasetUri
47352 47353 47354 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47352 def validation_dataset_uri @validation_dataset_uri end |
Instance Method Details
#update!(**args) ⇒ Object
Update properties of this object
47359 47360 47361 47362 47363 47364 47365 |
# File 'lib/google/apis/aiplatform_v1beta1/classes.rb', line 47359 def update!(**args) @composite_reward_config = args[:composite_reward_config] if args.key?(:composite_reward_config) @hyper_parameters = args[:hyper_parameters] if args.key?(:hyper_parameters) @single_reward_config = args[:single_reward_config] if args.key?(:single_reward_config) @training_dataset_uri = args[:training_dataset_uri] if args.key?(:training_dataset_uri) @validation_dataset_uri = args[:validation_dataset_uri] if args.key?(:validation_dataset_uri) end |