Class: Aws::BedrockAgent::Types::ParsingConfiguration

Inherits:
Struct
  • Object
show all
Includes:
Structure
Defined in:
lib/aws-sdk-bedrockagent/types.rb

Overview

Settings for parsing document contents. By default, the service converts the contents of each document into text before splitting it into chunks. To improve processing of PDF files with tables and images, you can configure the data source to convert the pages of text into images and use a model to describe the contents of each page.

To use a model to parse PDF documents, set the parsing strategy to ‘BEDROCK_FOUNDATION_MODEL` and specify the model to use by ARN. You can also override the default parsing prompt with instructions for how to interpret images and tables in your documents. The following models are supported.

  • Anthropic Claude 3 Sonnet - ‘anthropic.claude-3-sonnet-20240229-v1:0`

  • Anthropic Claude 3 Haiku - ‘anthropic.claude-3-haiku-20240307-v1:0`

You can get the ARN of a model with the action. Standard model usage charges apply for the foundation model parsing strategy.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#bedrock_foundation_model_configurationTypes::BedrockFoundationModelConfiguration

Settings for a foundation model used to parse documents for a data source.



5550
5551
5552
5553
5554
5555
# File 'lib/aws-sdk-bedrockagent/types.rb', line 5550

class ParsingConfiguration < Struct.new(
  :bedrock_foundation_model_configuration,
  :parsing_strategy)
  SENSITIVE = []
  include Aws::Structure
end

#parsing_strategyString

The parsing strategy for the data source.

Returns:

  • (String)


5550
5551
5552
5553
5554
5555
# File 'lib/aws-sdk-bedrockagent/types.rb', line 5550

class ParsingConfiguration < Struct.new(
  :bedrock_foundation_model_configuration,
  :parsing_strategy)
  SENSITIVE = []
  include Aws::Structure
end