Class: Aws::SageMaker::Types::ScalingPolicyMetric

Inherits:
Struct
  • Object
show all
Includes:
Aws::Structure
Defined in:
lib/aws-sdk-sagemaker/types.rb

Overview

The metric for a scaling policy.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#invocations_per_instanceInteger

The number of invocations sent to a model, normalized by ‘InstanceCount` in each ProductionVariant. `1/numberOfInstances` is sent as the value on each request, where `numberOfInstances` is the number of active instances for the ProductionVariant behind the endpoint at the time of the request.

Returns:

  • (Integer)


40156
40157
40158
40159
40160
40161
# File 'lib/aws-sdk-sagemaker/types.rb', line 40156

class ScalingPolicyMetric < Struct.new(
  :invocations_per_instance,
  :model_latency)
  SENSITIVE = []
  include Aws::Structure
end

#model_latencyInteger

The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.

Returns:

  • (Integer)


40156
40157
40158
40159
40160
40161
# File 'lib/aws-sdk-sagemaker/types.rb', line 40156

class ScalingPolicyMetric < Struct.new(
  :invocations_per_instance,
  :model_latency)
  SENSITIVE = []
  include Aws::Structure
end