Class KinesisStreamingSourceOptions
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<KinesisStreamingSourceOptions.Builder,KinesisStreamingSourceOptions>
Additional options for the Amazon Kinesis streaming data source.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionfinal BooleanAdds a time delay between two consecutive getRecords operations.final StringWhen this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream.final BooleanAvoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the batch is started.builder()final StringAn optional classification.final StringSpecifies the delimiter character.final LongThe minimum time interval between two ListShards API calls for your script to consider resharding.final StringWhen this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch.final StringThe URL of the Kinesis endpoint.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final inthashCode()final LongThe minimum time delay between two consecutive getRecords operations, specified in ms.final LongThe maximum number of records to fetch per shard in the Kinesis data stream per microbatch.final LongThe maximum time spent for the job executor to read records for the current batch from the Kinesis data stream, specified in milliseconds (ms).final LongThe maximum number of records to fetch from the Kinesis data stream in each getRecords operation.final LongThe maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API call.final IntegerThe maximum number of retries for Kinesis Data Streams API requests.final LongThe cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call.final StringroleArn()The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS).final StringAn identifier for the session assuming the role using AWS STS.static Class<? extends KinesisStreamingSourceOptions.Builder> final StartingPositionThe starting position in the Kinesis data stream to read data from.final StringThe starting position in the Kinesis data stream to read data from.final InstantThe timestamp of the record in the Kinesis data stream to start reading data from.final StringThe Amazon Resource Name (ARN) of the Kinesis data stream.final StringThe name of the Kinesis data stream.Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
endpointUrl
The URL of the Kinesis endpoint.
- Returns:
- The URL of the Kinesis endpoint.
-
streamName
The name of the Kinesis data stream.
- Returns:
- The name of the Kinesis data stream.
-
classification
An optional classification.
- Returns:
- An optional classification.
-
delimiter
Specifies the delimiter character.
- Returns:
- Specifies the delimiter character.
-
startingPosition
The starting position in the Kinesis data stream to read data from. The possible values are
"latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
If the service returns an enum value that is not available in the current SDK version,
startingPositionwill returnStartingPosition.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstartingPositionAsString().- Returns:
- The starting position in the Kinesis data stream to read data from. The possible values are
"latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
- See Also:
-
startingPositionAsString
The starting position in the Kinesis data stream to read data from. The possible values are
"latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
If the service returns an enum value that is not available in the current SDK version,
startingPositionwill returnStartingPosition.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstartingPositionAsString().- Returns:
- The starting position in the Kinesis data stream to read data from. The possible values are
"latest","trim_horizon","earliest", or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ(whereZrepresents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest".Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
- See Also:
-
maxFetchTimeInMs
The maximum time spent for the job executor to read records for the current batch from the Kinesis data stream, specified in milliseconds (ms). Multiple
GetRecordsAPI calls may be made within this time. The default value is1000.- Returns:
- The maximum time spent for the job executor to read records for the current batch from the Kinesis data
stream, specified in milliseconds (ms). Multiple
GetRecordsAPI calls may be made within this time. The default value is1000.
-
maxFetchRecordsPerShard
The maximum number of records to fetch per shard in the Kinesis data stream per microbatch. Note: The client can exceed this limit if the streaming job has already read extra records from Kinesis (in the same get-records call). If
MaxFetchRecordsPerShardneeds to be strict then it needs to be a multiple ofMaxRecordPerRead. The default value is100000.- Returns:
- The maximum number of records to fetch per shard in the Kinesis data stream per microbatch. Note: The
client can exceed this limit if the streaming job has already read extra records from Kinesis (in the
same get-records call). If
MaxFetchRecordsPerShardneeds to be strict then it needs to be a multiple ofMaxRecordPerRead. The default value is100000.
-
maxRecordPerRead
The maximum number of records to fetch from the Kinesis data stream in each getRecords operation. The default value is
10000.- Returns:
- The maximum number of records to fetch from the Kinesis data stream in each getRecords operation. The
default value is
10000.
-
addIdleTimeBetweenReads
Adds a time delay between two consecutive getRecords operations. The default value is
"False". This option is only configurable for Glue version 2.0 and above.- Returns:
- Adds a time delay between two consecutive getRecords operations. The default value is
"False". This option is only configurable for Glue version 2.0 and above.
-
idleTimeBetweenReadsInMs
The minimum time delay between two consecutive getRecords operations, specified in ms. The default value is
1000. This option is only configurable for Glue version 2.0 and above.- Returns:
- The minimum time delay between two consecutive getRecords operations, specified in ms. The default value
is
1000. This option is only configurable for Glue version 2.0 and above.
-
describeShardInterval
The minimum time interval between two ListShards API calls for your script to consider resharding. The default value is
1s.- Returns:
- The minimum time interval between two ListShards API calls for your script to consider resharding. The
default value is
1s.
-
numRetries
The maximum number of retries for Kinesis Data Streams API requests. The default value is
3.- Returns:
- The maximum number of retries for Kinesis Data Streams API requests. The default value is
3.
-
retryIntervalMs
The cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call. The default value is
1000.- Returns:
- The cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call. The default
value is
1000.
-
maxRetryIntervalMs
The maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API call. The default value is
10000.- Returns:
- The maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API
call. The default value is
10000.
-
avoidEmptyBatches
Avoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the batch is started. The default value is
"False".- Returns:
- Avoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the
batch is started. The default value is
"False".
-
streamArn
The Amazon Resource Name (ARN) of the Kinesis data stream.
- Returns:
- The Amazon Resource Name (ARN) of the Kinesis data stream.
-
roleArn
The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS). This role must have permissions for describe or read record operations for the Kinesis data stream. You must use this parameter when accessing a data stream in a different account. Used in conjunction with
"awsSTSSessionName".- Returns:
- The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS). This
role must have permissions for describe or read record operations for the Kinesis data stream. You must
use this parameter when accessing a data stream in a different account. Used in conjunction with
"awsSTSSessionName".
-
roleSessionName
An identifier for the session assuming the role using AWS STS. You must use this parameter when accessing a data stream in a different account. Used in conjunction with
"awsSTSRoleARN".- Returns:
- An identifier for the session assuming the role using AWS STS. You must use this parameter when accessing
a data stream in a different account. Used in conjunction with
"awsSTSRoleARN".
-
addRecordTimestamp
When this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream. The default value is 'false'. This option is supported in Glue version 4.0 or later.
- Returns:
- When this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream. The default value is 'false'. This option is supported in Glue version 4.0 or later.
-
emitConsumerLagMetrics
When this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch. The metric's name is "glue.driver.streaming.maxConsumerLagInMs". The default value is 'false'. This option is supported in Glue version 4.0 or later.
- Returns:
- When this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch. The metric's name is "glue.driver.streaming.maxConsumerLagInMs". The default value is 'false'. This option is supported in Glue version 4.0 or later.
-
startingTimestamp
The timestamp of the record in the Kinesis data stream to start reading data from. The possible values are a timestamp string in UTC format of the pattern
yyyy-mm-ddTHH:MM:SSZ(where Z represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00+08:00").- Returns:
- The timestamp of the record in the Kinesis data stream to start reading data from. The possible values
are a timestamp string in UTC format of the pattern
yyyy-mm-ddTHH:MM:SSZ(where Z represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00+08:00").
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<KinesisStreamingSourceOptions.Builder,KinesisStreamingSourceOptions> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-