Interface DataProcessing.Builder
- All Superinterfaces:
- Buildable,- CopyableBuilder<DataProcessing.Builder,,- DataProcessing> - SdkBuilder<DataProcessing.Builder,,- DataProcessing> - SdkPojo
- Enclosing class:
- DataProcessing
- 
Method SummaryModifier and TypeMethodDescriptioninputFilter(String inputFilter) A JSONPath expression used to select a portion of the input data to pass to the algorithm.joinSource(String joinSource) Specifies the source of the data to join with the transformed data.joinSource(JoinSource joinSource) Specifies the source of the data to join with the transformed data.outputFilter(String outputFilter) A JSONPath expression used to select a portion of the joined dataset to save in the output file for a batch transform job.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuildercopyMethods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilderapplyMutation, buildMethods inherited from interface software.amazon.awssdk.core.SdkPojoequalsBySdkFields, sdkFields
- 
Method Details- 
inputFilterA JSONPath expression used to select a portion of the input data to pass to the algorithm. Use the InputFilterparameter to exclude fields, such as an ID column, from the input. If you want SageMaker to pass the entire input dataset to the algorithm, accept the default value$.Examples: "$","$[1:]","$.features"- Parameters:
- inputFilter- A JSONPath expression used to select a portion of the input data to pass to the algorithm. Use the- InputFilterparameter to exclude fields, such as an ID column, from the input. If you want SageMaker to pass the entire input dataset to the algorithm, accept the default value- $.- Examples: - "$",- "$[1:]",- "$.features"
- Returns:
- Returns a reference to this object so that method calls can be chained together.
 
- 
outputFilterA JSONPath expression used to select a portion of the joined dataset to save in the output file for a batch transform job. If you want SageMaker to store the entire input dataset in the output file, leave the default value, $. If you specify indexes that aren't within the dimension size of the joined dataset, you get an error.Examples: "$","$[0,5:]","$['id','SageMakerOutput']"- Parameters:
- outputFilter- A JSONPath expression used to select a portion of the joined dataset to save in the output file for a batch transform job. If you want SageMaker to store the entire input dataset in the output file, leave the default value,- $. If you specify indexes that aren't within the dimension size of the joined dataset, you get an error.- Examples: - "$",- "$[0,5:]",- "$['id','SageMakerOutput']"
- Returns:
- Returns a reference to this object so that method calls can be chained together.
 
- 
joinSourceSpecifies the source of the data to join with the transformed data. The valid values are NoneandInput. The default value isNone, which specifies not to join the input with the transformed data. If you want the batch transform job to join the original input data with the transformed data, setJoinSourcetoInput. You can specifyOutputFilteras an additional filter to select a portion of the joined dataset and store it in the output file.For JSON or JSONLines objects, such as a JSON array, SageMaker adds the transformed data to the input JSON object in an attribute called SageMakerOutput. The joined result for JSON must be a key-value pair object. If the input is not a key-value pair object, SageMaker creates a new JSON file. In the new JSON file, and the input data is stored under theSageMakerInputkey and the results are stored inSageMakerOutput.For CSV data, SageMaker takes each row as a JSON array and joins the transformed data with the input by appending each transformed row to the end of the input. The joined data has the original input data followed by the transformed data and the output is a CSV file. For information on how joining in applied, see Workflow for Associating Inferences with Input Records. - Parameters:
- joinSource- Specifies the source of the data to join with the transformed data. The valid values are- Noneand- Input. The default value is- None, which specifies not to join the input with the transformed data. If you want the batch transform job to join the original input data with the transformed data, set- JoinSourceto- Input. You can specify- OutputFilteras an additional filter to select a portion of the joined dataset and store it in the output file.- For JSON or JSONLines objects, such as a JSON array, SageMaker adds the transformed data to the input JSON object in an attribute called - SageMakerOutput. The joined result for JSON must be a key-value pair object. If the input is not a key-value pair object, SageMaker creates a new JSON file. In the new JSON file, and the input data is stored under the- SageMakerInputkey and the results are stored in- SageMakerOutput.- For CSV data, SageMaker takes each row as a JSON array and joins the transformed data with the input by appending each transformed row to the end of the input. The joined data has the original input data followed by the transformed data and the output is a CSV file. - For information on how joining in applied, see Workflow for Associating Inferences with Input Records. 
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
 
- 
joinSourceSpecifies the source of the data to join with the transformed data. The valid values are NoneandInput. The default value isNone, which specifies not to join the input with the transformed data. If you want the batch transform job to join the original input data with the transformed data, setJoinSourcetoInput. You can specifyOutputFilteras an additional filter to select a portion of the joined dataset and store it in the output file.For JSON or JSONLines objects, such as a JSON array, SageMaker adds the transformed data to the input JSON object in an attribute called SageMakerOutput. The joined result for JSON must be a key-value pair object. If the input is not a key-value pair object, SageMaker creates a new JSON file. In the new JSON file, and the input data is stored under theSageMakerInputkey and the results are stored inSageMakerOutput.For CSV data, SageMaker takes each row as a JSON array and joins the transformed data with the input by appending each transformed row to the end of the input. The joined data has the original input data followed by the transformed data and the output is a CSV file. For information on how joining in applied, see Workflow for Associating Inferences with Input Records. - Parameters:
- joinSource- Specifies the source of the data to join with the transformed data. The valid values are- Noneand- Input. The default value is- None, which specifies not to join the input with the transformed data. If you want the batch transform job to join the original input data with the transformed data, set- JoinSourceto- Input. You can specify- OutputFilteras an additional filter to select a portion of the joined dataset and store it in the output file.- For JSON or JSONLines objects, such as a JSON array, SageMaker adds the transformed data to the input JSON object in an attribute called - SageMakerOutput. The joined result for JSON must be a key-value pair object. If the input is not a key-value pair object, SageMaker creates a new JSON file. In the new JSON file, and the input data is stored under the- SageMakerInputkey and the results are stored in- SageMakerOutput.- For CSV data, SageMaker takes each row as a JSON array and joins the transformed data with the input by appending each transformed row to the end of the input. The joined data has the original input data followed by the transformed data and the output is a CSV file. - For information on how joining in applied, see Workflow for Associating Inferences with Input Records. 
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
 
 
-