Module: OpenAI::Models::FineTuning::DpoHyperparameters::Beta

Extended by:
Internal::Type::Union
Defined in:
lib/openai/models/fine_tuning/dpo_hyperparameters.rb

Overview

The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.

See Also:

Method Summary

Methods included from Internal::Type::Union

==, ===, coerce, dump, hash, inspect, to_sorbet_type, variants

Methods included from Internal::Util::SorbetRuntimeSupport

#const_missing, #define_sorbet_constant!, #sorbet_constant_defined?, #to_sorbet_type, to_sorbet_type

Methods included from Internal::Type::Converter

#coerce, coerce, #dump, dump, #inspect, inspect, type_info