Interface DataIntegrationFlowDatasetOptions.Builder
- All Superinterfaces:
Buildable
,CopyableBuilder<DataIntegrationFlowDatasetOptions.Builder,
,DataIntegrationFlowDatasetOptions> SdkBuilder<DataIntegrationFlowDatasetOptions.Builder,
,DataIntegrationFlowDatasetOptions> SdkPojo
- Enclosing class:
DataIntegrationFlowDatasetOptions
-
Method Summary
Modifier and TypeMethodDescriptiondedupeRecords
(Boolean dedupeRecords) The option to perform deduplication on data records sharing same primary key values.dedupeStrategy
(Consumer<DataIntegrationFlowDedupeStrategy.Builder> dedupeStrategy) The deduplication strategy to dedupe the data records sharing same primary key values of the target dataset.dedupeStrategy
(DataIntegrationFlowDedupeStrategy dedupeStrategy) The deduplication strategy to dedupe the data records sharing same primary key values of the target dataset.The target dataset's data load type.loadType
(DataIntegrationFlowLoadType loadType) The target dataset's data load type.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
Method Details
-
loadType
The target dataset's data load type. This only affects how source S3 files are selected in the S3-to-dataset flow.
-
REPLACE - Target dataset will get replaced with the new file added under the source s3 prefix.
-
INCREMENTAL - Target dataset will get updated with the up-to-date content under S3 prefix incorporating any file additions or removals there.
- Parameters:
loadType
- The target dataset's data load type. This only affects how source S3 files are selected in the S3-to-dataset flow.-
REPLACE - Target dataset will get replaced with the new file added under the source s3 prefix.
-
INCREMENTAL - Target dataset will get updated with the up-to-date content under S3 prefix incorporating any file additions or removals there.
-
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
-
loadType
The target dataset's data load type. This only affects how source S3 files are selected in the S3-to-dataset flow.
-
REPLACE - Target dataset will get replaced with the new file added under the source s3 prefix.
-
INCREMENTAL - Target dataset will get updated with the up-to-date content under S3 prefix incorporating any file additions or removals there.
- Parameters:
loadType
- The target dataset's data load type. This only affects how source S3 files are selected in the S3-to-dataset flow.-
REPLACE - Target dataset will get replaced with the new file added under the source s3 prefix.
-
INCREMENTAL - Target dataset will get updated with the up-to-date content under S3 prefix incorporating any file additions or removals there.
-
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
-
dedupeRecords
The option to perform deduplication on data records sharing same primary key values. If disabled, transformed data with duplicate primary key values will ingest into dataset, for datasets within asc namespace, such duplicates will cause ingestion fail. If enabled without dedupeStrategy, deduplication is done by retaining a random data record among those sharing the same primary key values. If enabled with dedupeStragtegy, the deduplication is done following the strategy.
Note that target dataset may have partition configured, when dedupe is enabled, it only dedupe against primary keys and retain only one record out of those duplicates regardless of its partition status.
- Parameters:
dedupeRecords
- The option to perform deduplication on data records sharing same primary key values. If disabled, transformed data with duplicate primary key values will ingest into dataset, for datasets within asc namespace, such duplicates will cause ingestion fail. If enabled without dedupeStrategy, deduplication is done by retaining a random data record among those sharing the same primary key values. If enabled with dedupeStragtegy, the deduplication is done following the strategy.Note that target dataset may have partition configured, when dedupe is enabled, it only dedupe against primary keys and retain only one record out of those duplicates regardless of its partition status.
- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
dedupeStrategy
DataIntegrationFlowDatasetOptions.Builder dedupeStrategy(DataIntegrationFlowDedupeStrategy dedupeStrategy) The deduplication strategy to dedupe the data records sharing same primary key values of the target dataset. This strategy only applies to target dataset with primary keys and with dedupeRecords option enabled. If transformed data still got duplicates after the dedupeStrategy evaluation, a random data record is chosen to be retained.
- Parameters:
dedupeStrategy
- The deduplication strategy to dedupe the data records sharing same primary key values of the target dataset. This strategy only applies to target dataset with primary keys and with dedupeRecords option enabled. If transformed data still got duplicates after the dedupeStrategy evaluation, a random data record is chosen to be retained.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
dedupeStrategy
default DataIntegrationFlowDatasetOptions.Builder dedupeStrategy(Consumer<DataIntegrationFlowDedupeStrategy.Builder> dedupeStrategy) The deduplication strategy to dedupe the data records sharing same primary key values of the target dataset. This strategy only applies to target dataset with primary keys and with dedupeRecords option enabled. If transformed data still got duplicates after the dedupeStrategy evaluation, a random data record is chosen to be retained.
This is a convenience method that creates an instance of theDataIntegrationFlowDedupeStrategy.Builder
avoiding the need to create one manually viaDataIntegrationFlowDedupeStrategy.builder()
.When the
Consumer
completes,SdkBuilder.build()
is called immediately and its result is passed todedupeStrategy(DataIntegrationFlowDedupeStrategy)
.- Parameters:
dedupeStrategy
- a consumer that will call methods onDataIntegrationFlowDedupeStrategy.Builder
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-