Interface StartMlDataProcessingJobRequest.Builder

  • Method Details

    • id

      A unique identifier for the new job. The default is an autogenerated UUID.

      Parameters:
      id - A unique identifier for the new job. The default is an autogenerated UUID.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • previousDataProcessingJobId

      StartMlDataProcessingJobRequest.Builder previousDataProcessingJobId(String previousDataProcessingJobId)

      The job ID of a completed data processing job run on an earlier version of the data.

      Parameters:
      previousDataProcessingJobId - The job ID of a completed data processing job run on an earlier version of the data.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • inputDataS3Location

      StartMlDataProcessingJobRequest.Builder inputDataS3Location(String inputDataS3Location)

      The URI of the Amazon S3 location where you want SageMaker to download the data needed to run the data processing job.

      Parameters:
      inputDataS3Location - The URI of the Amazon S3 location where you want SageMaker to download the data needed to run the data processing job.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • processedDataS3Location

      StartMlDataProcessingJobRequest.Builder processedDataS3Location(String processedDataS3Location)

      The URI of the Amazon S3 location where you want SageMaker to save the results of a data processing job.

      Parameters:
      processedDataS3Location - The URI of the Amazon S3 location where you want SageMaker to save the results of a data processing job.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • sagemakerIamRoleArn

      StartMlDataProcessingJobRequest.Builder sagemakerIamRoleArn(String sagemakerIamRoleArn)

      The ARN of an IAM role for SageMaker execution. This must be listed in your DB cluster parameter group or an error will occur.

      Parameters:
      sagemakerIamRoleArn - The ARN of an IAM role for SageMaker execution. This must be listed in your DB cluster parameter group or an error will occur.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • neptuneIamRoleArn

      StartMlDataProcessingJobRequest.Builder neptuneIamRoleArn(String neptuneIamRoleArn)

      The Amazon Resource Name (ARN) of an IAM role that SageMaker can assume to perform tasks on your behalf. This must be listed in your DB cluster parameter group or an error will occur.

      Parameters:
      neptuneIamRoleArn - The Amazon Resource Name (ARN) of an IAM role that SageMaker can assume to perform tasks on your behalf. This must be listed in your DB cluster parameter group or an error will occur.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • processingInstanceType

      StartMlDataProcessingJobRequest.Builder processingInstanceType(String processingInstanceType)

      The type of ML instance used during data processing. Its memory should be large enough to hold the processed dataset. The default is the smallest ml.r5 type whose memory is ten times larger than the size of the exported graph data on disk.

      Parameters:
      processingInstanceType - The type of ML instance used during data processing. Its memory should be large enough to hold the processed dataset. The default is the smallest ml.r5 type whose memory is ten times larger than the size of the exported graph data on disk.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • processingInstanceVolumeSizeInGB

      StartMlDataProcessingJobRequest.Builder processingInstanceVolumeSizeInGB(Integer processingInstanceVolumeSizeInGB)

      The disk volume size of the processing instance. Both input data and processed data are stored on disk, so the volume size must be large enough to hold both data sets. The default is 0. If not specified or 0, Neptune ML chooses the volume size automatically based on the data size.

      Parameters:
      processingInstanceVolumeSizeInGB - The disk volume size of the processing instance. Both input data and processed data are stored on disk, so the volume size must be large enough to hold both data sets. The default is 0. If not specified or 0, Neptune ML chooses the volume size automatically based on the data size.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • processingTimeOutInSeconds

      StartMlDataProcessingJobRequest.Builder processingTimeOutInSeconds(Integer processingTimeOutInSeconds)

      Timeout in seconds for the data processing job. The default is 86,400 (1 day).

      Parameters:
      processingTimeOutInSeconds - Timeout in seconds for the data processing job. The default is 86,400 (1 day).
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • modelType

      One of the two model types that Neptune ML currently supports: heterogeneous graph models ( heterogeneous), and knowledge graph (kge). The default is none. If not specified, Neptune ML chooses the model type automatically based on the data.

      Parameters:
      modelType - One of the two model types that Neptune ML currently supports: heterogeneous graph models ( heterogeneous), and knowledge graph (kge). The default is none. If not specified, Neptune ML chooses the model type automatically based on the data.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • configFileName

      StartMlDataProcessingJobRequest.Builder configFileName(String configFileName)

      A data specification file that describes how to load the exported graph data for training. The file is automatically generated by the Neptune export toolkit. The default is training-data-configuration.json.

      Parameters:
      configFileName - A data specification file that describes how to load the exported graph data for training. The file is automatically generated by the Neptune export toolkit. The default is training-data-configuration.json.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • subnets

      The IDs of the subnets in the Neptune VPC. The default is None.

      Parameters:
      subnets - The IDs of the subnets in the Neptune VPC. The default is None.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • subnets

      The IDs of the subnets in the Neptune VPC. The default is None.

      Parameters:
      subnets - The IDs of the subnets in the Neptune VPC. The default is None.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • securityGroupIds

      StartMlDataProcessingJobRequest.Builder securityGroupIds(Collection<String> securityGroupIds)

      The VPC security group IDs. The default is None.

      Parameters:
      securityGroupIds - The VPC security group IDs. The default is None.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • securityGroupIds

      StartMlDataProcessingJobRequest.Builder securityGroupIds(String... securityGroupIds)

      The VPC security group IDs. The default is None.

      Parameters:
      securityGroupIds - The VPC security group IDs. The default is None.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • volumeEncryptionKMSKey

      StartMlDataProcessingJobRequest.Builder volumeEncryptionKMSKey(String volumeEncryptionKMSKey)

      The Amazon Key Management Service (Amazon KMS) key that SageMaker uses to encrypt data on the storage volume attached to the ML compute instances that run the training job. The default is None.

      Parameters:
      volumeEncryptionKMSKey - The Amazon Key Management Service (Amazon KMS) key that SageMaker uses to encrypt data on the storage volume attached to the ML compute instances that run the training job. The default is None.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • s3OutputEncryptionKMSKey

      StartMlDataProcessingJobRequest.Builder s3OutputEncryptionKMSKey(String s3OutputEncryptionKMSKey)

      The Amazon Key Management Service (Amazon KMS) key that SageMaker uses to encrypt the output of the processing job. The default is none.

      Parameters:
      s3OutputEncryptionKMSKey - The Amazon Key Management Service (Amazon KMS) key that SageMaker uses to encrypt the output of the processing job. The default is none.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • overrideConfiguration

      StartMlDataProcessingJobRequest.Builder overrideConfiguration(AwsRequestOverrideConfiguration overrideConfiguration)
      Description copied from interface: AwsRequest.Builder
      Add an optional request override configuration.
      Specified by:
      overrideConfiguration in interface AwsRequest.Builder
      Parameters:
      overrideConfiguration - The override configuration.
      Returns:
      This object for method chaining.
    • overrideConfiguration

      Description copied from interface: AwsRequest.Builder
      Add an optional request override configuration.
      Specified by:
      overrideConfiguration in interface AwsRequest.Builder
      Parameters:
      builderConsumer - A Consumer to which an empty AwsRequestOverrideConfiguration.Builder will be given.
      Returns:
      This object for method chaining.