Class: Aws::Comprehend::Types::DatasetInputDataConfig

Inherits:
Struct
  • Object
show all
Includes:
Structure
Defined in:
lib/aws-sdk-comprehend/types.rb

Overview

Specifies the format and location of the input data for the dataset.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#augmented_manifestsArray<Types::DatasetAugmentedManifestsListItem>

A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.



1716
1717
1718
1719
1720
1721
1722
1723
# File 'lib/aws-sdk-comprehend/types.rb', line 1716

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end

#data_formatString

‘COMPREHEND_CSV`: The data format is a two-column CSV file, where the first column contains labels and the second column contains documents.

‘AUGMENTED_MANIFEST`: The data format

Returns:

  • (String)


1716
1717
1718
1719
1720
1721
1722
1723
# File 'lib/aws-sdk-comprehend/types.rb', line 1716

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end

#document_classifier_input_data_configTypes::DatasetDocumentClassifierInputDataConfig

The input properties for training a document classifier model.

For more information on how the input file is formatted, see

Preparing training data][1

in the Comprehend Developer Guide.

[1]: docs.aws.amazon.com/comprehend/latest/dg/prep-classifier-data.html



1716
1717
1718
1719
1720
1721
1722
1723
# File 'lib/aws-sdk-comprehend/types.rb', line 1716

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end

#entity_recognizer_input_data_configTypes::DatasetEntityRecognizerInputDataConfig

The input properties for training an entity recognizer model.



1716
1717
1718
1719
1720
1721
1722
1723
# File 'lib/aws-sdk-comprehend/types.rb', line 1716

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end