Class KinesisStreamingSourceOptions
- All Implemented Interfaces:
- Serializable,- SdkPojo,- ToCopyableBuilder<KinesisStreamingSourceOptions.Builder,- KinesisStreamingSourceOptions> 
Additional options for the Amazon Kinesis streaming data source.
- See Also:
- 
Nested Class SummaryNested Classes
- 
Method SummaryModifier and TypeMethodDescriptionfinal BooleanAdds a time delay between two consecutive getRecords operations.final StringWhen this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream.final BooleanAvoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the batch is started.builder()final StringAn optional classification.final StringSpecifies the delimiter character.final LongThe minimum time interval between two ListShards API calls for your script to consider resharding.final StringWhen this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch.final StringThe URL of the Kinesis endpoint.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final inthashCode()final LongThe minimum time delay between two consecutive getRecords operations, specified in ms.final LongThe maximum number of records to fetch per shard in the Kinesis data stream per microbatch.final LongThe maximum time spent for the job executor to read records for the current batch from the Kinesis data stream, specified in milliseconds (ms).final LongThe maximum number of records to fetch from the Kinesis data stream in each getRecords operation.final LongThe maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API call.final IntegerThe maximum number of retries for Kinesis Data Streams API requests.final LongThe cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call.final StringroleArn()The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS).final StringAn identifier for the session assuming the role using AWS STS.static Class<? extends KinesisStreamingSourceOptions.Builder> final StartingPositionThe starting position in the Kinesis data stream to read data from.final StringThe starting position in the Kinesis data stream to read data from.final InstantThe timestamp of the record in the Kinesis data stream to start reading data from.final StringThe Amazon Resource Name (ARN) of the Kinesis data stream.final StringThe name of the Kinesis data stream.Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuildercopy
- 
Method Details- 
endpointUrlThe URL of the Kinesis endpoint. - Returns:
- The URL of the Kinesis endpoint.
 
- 
streamNameThe name of the Kinesis data stream. - Returns:
- The name of the Kinesis data stream.
 
- 
classificationAn optional classification. - Returns:
- An optional classification.
 
- 
delimiterSpecifies the delimiter character. - Returns:
- Specifies the delimiter character.
 
- 
startingPositionThe starting position in the Kinesis data stream to read data from. The possible values are "latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later. If the service returns an enum value that is not available in the current SDK version, startingPositionwill returnStartingPosition.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstartingPositionAsString().- Returns:
- The starting position in the Kinesis data stream to read data from. The possible values are
         "latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later. 
- See Also:
 
- 
startingPositionAsStringThe starting position in the Kinesis data stream to read data from. The possible values are "latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later. If the service returns an enum value that is not available in the current SDK version, startingPositionwill returnStartingPosition.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstartingPositionAsString().- Returns:
- The starting position in the Kinesis data stream to read data from. The possible values are
         "latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later. 
- See Also:
 
- 
maxFetchTimeInMsThe maximum time spent for the job executor to read records for the current batch from the Kinesis data stream, specified in milliseconds (ms). Multiple GetRecordsAPI calls may be made within this time. The default value is1000.- Returns:
- The maximum time spent for the job executor to read records for the current batch from the Kinesis data
         stream, specified in milliseconds (ms). Multiple GetRecordsAPI calls may be made within this time. The default value is1000.
 
- 
maxFetchRecordsPerShardThe maximum number of records to fetch per shard in the Kinesis data stream per microbatch. Note: The client can exceed this limit if the streaming job has already read extra records from Kinesis (in the same get-records call). If MaxFetchRecordsPerShardneeds to be strict then it needs to be a multiple ofMaxRecordPerRead. The default value is100000.- Returns:
- The maximum number of records to fetch per shard in the Kinesis data stream per microbatch. Note: The
         client can exceed this limit if the streaming job has already read extra records from Kinesis (in the
         same get-records call). If MaxFetchRecordsPerShardneeds to be strict then it needs to be a multiple ofMaxRecordPerRead. The default value is100000.
 
- 
maxRecordPerReadThe maximum number of records to fetch from the Kinesis data stream in each getRecords operation. The default value is 10000.- Returns:
- The maximum number of records to fetch from the Kinesis data stream in each getRecords operation. The
         default value is 10000.
 
- 
addIdleTimeBetweenReadsAdds a time delay between two consecutive getRecords operations. The default value is "False". This option is only configurable for Glue version 2.0 and above.- Returns:
- Adds a time delay between two consecutive getRecords operations. The default value is
         "False". This option is only configurable for Glue version 2.0 and above.
 
- 
idleTimeBetweenReadsInMsThe minimum time delay between two consecutive getRecords operations, specified in ms. The default value is 1000. This option is only configurable for Glue version 2.0 and above.- Returns:
- The minimum time delay between two consecutive getRecords operations, specified in ms. The default value
         is 1000. This option is only configurable for Glue version 2.0 and above.
 
- 
describeShardIntervalThe minimum time interval between two ListShards API calls for your script to consider resharding. The default value is 1s.- Returns:
- The minimum time interval between two ListShards API calls for your script to consider resharding. The
         default value is 1s.
 
- 
numRetriesThe maximum number of retries for Kinesis Data Streams API requests. The default value is 3.- Returns:
- The maximum number of retries for Kinesis Data Streams API requests. The default value is 3.
 
- 
retryIntervalMsThe cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call. The default value is 1000.- Returns:
- The cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call. The default
         value is 1000.
 
- 
maxRetryIntervalMsThe maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API call. The default value is 10000.- Returns:
- The maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API
         call. The default value is 10000.
 
- 
avoidEmptyBatchesAvoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the batch is started. The default value is "False".- Returns:
- Avoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the
         batch is started. The default value is "False".
 
- 
streamArnThe Amazon Resource Name (ARN) of the Kinesis data stream. - Returns:
- The Amazon Resource Name (ARN) of the Kinesis data stream.
 
- 
roleArnThe Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS). This role must have permissions for describe or read record operations for the Kinesis data stream. You must use this parameter when accessing a data stream in a different account. Used in conjunction with "awsSTSSessionName".- Returns:
- The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS). This
         role must have permissions for describe or read record operations for the Kinesis data stream. You must
         use this parameter when accessing a data stream in a different account. Used in conjunction with
         "awsSTSSessionName".
 
- 
roleSessionNameAn identifier for the session assuming the role using AWS STS. You must use this parameter when accessing a data stream in a different account. Used in conjunction with "awsSTSRoleARN".- Returns:
- An identifier for the session assuming the role using AWS STS. You must use this parameter when accessing
         a data stream in a different account. Used in conjunction with "awsSTSRoleARN".
 
- 
addRecordTimestampWhen this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream. The default value is 'false'. This option is supported in Glue version 4.0 or later. - Returns:
- When this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream. The default value is 'false'. This option is supported in Glue version 4.0 or later.
 
- 
emitConsumerLagMetricsWhen this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch. The metric's name is "glue.driver.streaming.maxConsumerLagInMs". The default value is 'false'. This option is supported in Glue version 4.0 or later. - Returns:
- When this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch. The metric's name is "glue.driver.streaming.maxConsumerLagInMs". The default value is 'false'. This option is supported in Glue version 4.0 or later.
 
- 
startingTimestampThe timestamp of the record in the Kinesis data stream to start reading data from. The possible values are a timestamp string in UTC format of the pattern yyyy-mm-ddTHH:MM:SSZ(where Z represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00+08:00").- Returns:
- The timestamp of the record in the Kinesis data stream to start reading data from. The possible values
         are a timestamp string in UTC format of the pattern yyyy-mm-ddTHH:MM:SSZ(where Z represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00+08:00").
 
- 
toBuilderDescription copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
- toBuilderin interface- ToCopyableBuilder<KinesisStreamingSourceOptions.Builder,- KinesisStreamingSourceOptions> 
- Returns:
- a builder for type T
 
- 
builder
- 
serializableBuilderClass
- 
hashCode
- 
equals
- 
equalsBySdkFieldsDescription copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
- equalsBySdkFieldsin interface- SdkPojo
- Parameters:
- obj- the object to be compared with
- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
 
- 
toString
- 
getValueForField
- 
sdkFields
 
-