Class: Aws::Comprehend::Types::DatasetInputDataConfig
- Inherits:
 - 
      Struct
      
        
- Object
 - Struct
 - Aws::Comprehend::Types::DatasetInputDataConfig
 
 
- Includes:
 - Structure
 
- Defined in:
 - lib/aws-sdk-comprehend/types.rb
 
Overview
Specifies the format and location of the input data for the dataset.
Constant Summary collapse
- SENSITIVE =
 []
Instance Attribute Summary collapse
- 
  
    
      #augmented_manifests  ⇒ Array<Types::DatasetAugmentedManifestsListItem> 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
A list of augmented manifest files that provide training data for your custom model.
 - 
  
    
      #data_format  ⇒ String 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
‘COMPREHEND_CSV`: The data format is a two-column CSV file, where the first column contains labels and the second column contains documents.
 - 
  
    
      #document_classifier_input_data_config  ⇒ Types::DatasetDocumentClassifierInputDataConfig 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
The input properties for training a document classifier model.
 - 
  
    
      #entity_recognizer_input_data_config  ⇒ Types::DatasetEntityRecognizerInputDataConfig 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
The input properties for training an entity recognizer model.
 
Instance Attribute Details
#augmented_manifests ⇒ Array<Types::DatasetAugmentedManifestsListItem>
A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
      1737 1738 1739 1740 1741 1742 1743 1744  | 
    
      # File 'lib/aws-sdk-comprehend/types.rb', line 1737 class DatasetInputDataConfig < Struct.new( :augmented_manifests, :data_format, :document_classifier_input_data_config, :entity_recognizer_input_data_config) SENSITIVE = [] include Aws::Structure end  | 
  
#data_format ⇒ String
‘COMPREHEND_CSV`: The data format is a two-column CSV file, where the first column contains labels and the second column contains documents.
‘AUGMENTED_MANIFEST`: The data format
      1737 1738 1739 1740 1741 1742 1743 1744  | 
    
      # File 'lib/aws-sdk-comprehend/types.rb', line 1737 class DatasetInputDataConfig < Struct.new( :augmented_manifests, :data_format, :document_classifier_input_data_config, :entity_recognizer_input_data_config) SENSITIVE = [] include Aws::Structure end  | 
  
#document_classifier_input_data_config ⇒ Types::DatasetDocumentClassifierInputDataConfig
The input properties for training a document classifier model.
For more information on how the input file is formatted, see
- Preparing training data][1
 - 
in the Comprehend Developer Guide.
 
[1]: docs.aws.amazon.com/comprehend/latest/dg/prep-classifier-data.html
      1737 1738 1739 1740 1741 1742 1743 1744  | 
    
      # File 'lib/aws-sdk-comprehend/types.rb', line 1737 class DatasetInputDataConfig < Struct.new( :augmented_manifests, :data_format, :document_classifier_input_data_config, :entity_recognizer_input_data_config) SENSITIVE = [] include Aws::Structure end  | 
  
#entity_recognizer_input_data_config ⇒ Types::DatasetEntityRecognizerInputDataConfig
The input properties for training an entity recognizer model.
      1737 1738 1739 1740 1741 1742 1743 1744  | 
    
      # File 'lib/aws-sdk-comprehend/types.rb', line 1737 class DatasetInputDataConfig < Struct.new( :augmented_manifests, :data_format, :document_classifier_input_data_config, :entity_recognizer_input_data_config) SENSITIVE = [] include Aws::Structure end  |