Class KinesisStreamingSourceOptions
- All Implemented Interfaces:
Serializable
,SdkPojo
,ToCopyableBuilder<KinesisStreamingSourceOptions.Builder,
KinesisStreamingSourceOptions>
Additional options for the Amazon Kinesis streaming data source.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionfinal Boolean
Adds a time delay between two consecutive getRecords operations.final String
When this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream.final Boolean
Avoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the batch is started.builder()
final String
An optional classification.final String
Specifies the delimiter character.final Long
The minimum time interval between two ListShards API calls for your script to consider resharding.final String
When this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch.final String
The URL of the Kinesis endpoint.final boolean
final boolean
equalsBySdkFields
(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional
<T> getValueForField
(String fieldName, Class<T> clazz) final int
hashCode()
final Long
The minimum time delay between two consecutive getRecords operations, specified in ms.final Long
The maximum number of records to fetch per shard in the Kinesis data stream.final Long
The maximum time spent in the job executor to fetch a record from the Kinesis data stream per shard, specified in milliseconds (ms).final Long
The maximum number of records to fetch from the Kinesis data stream in each getRecords operation.final Long
The maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API call.final Integer
The maximum number of retries for Kinesis Data Streams API requests.final Long
The cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call.final String
roleArn()
The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS).final String
An identifier for the session assuming the role using AWS STS.static Class
<? extends KinesisStreamingSourceOptions.Builder> final StartingPosition
The starting position in the Kinesis data stream to read data from.final String
The starting position in the Kinesis data stream to read data from.final Instant
The timestamp of the record in the Kinesis data stream to start reading data from.final String
The Amazon Resource Name (ARN) of the Kinesis data stream.final String
The name of the Kinesis data stream.Take this object and create a builder that contains all of the current property values of this object.final String
toString()
Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
endpointUrl
The URL of the Kinesis endpoint.
- Returns:
- The URL of the Kinesis endpoint.
-
streamName
The name of the Kinesis data stream.
- Returns:
- The name of the Kinesis data stream.
-
classification
An optional classification.
- Returns:
- An optional classification.
-
delimiter
Specifies the delimiter character.
- Returns:
- Specifies the delimiter character.
-
startingPosition
The starting position in the Kinesis data stream to read data from. The possible values are
"latest"
,"trim_horizon"
,"earliest"
, or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ
(whereZ
represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest"
.Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
If the service returns an enum value that is not available in the current SDK version,
startingPosition
will returnStartingPosition.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromstartingPositionAsString()
.- Returns:
- The starting position in the Kinesis data stream to read data from. The possible values are
"latest"
,"trim_horizon"
,"earliest"
, or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ
(whereZ
represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest"
.Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
- See Also:
-
startingPositionAsString
The starting position in the Kinesis data stream to read data from. The possible values are
"latest"
,"trim_horizon"
,"earliest"
, or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ
(whereZ
represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest"
.Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
If the service returns an enum value that is not available in the current SDK version,
startingPosition
will returnStartingPosition.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromstartingPositionAsString()
.- Returns:
- The starting position in the Kinesis data stream to read data from. The possible values are
"latest"
,"trim_horizon"
,"earliest"
, or a timestamp string in UTC format in the patternyyyy-mm-ddTHH:MM:SSZ
(whereZ
represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00-04:00"). The default value is"latest"
.Note: Using a value that is a timestamp string in UTC format for "startingPosition" is supported only for Glue version 4.0 or later.
- See Also:
-
maxFetchTimeInMs
The maximum time spent in the job executor to fetch a record from the Kinesis data stream per shard, specified in milliseconds (ms). The default value is
1000
.- Returns:
- The maximum time spent in the job executor to fetch a record from the Kinesis data stream per shard,
specified in milliseconds (ms). The default value is
1000
.
-
maxFetchRecordsPerShard
The maximum number of records to fetch per shard in the Kinesis data stream. The default value is
100000
.- Returns:
- The maximum number of records to fetch per shard in the Kinesis data stream. The default value is
100000
.
-
maxRecordPerRead
The maximum number of records to fetch from the Kinesis data stream in each getRecords operation. The default value is
10000
.- Returns:
- The maximum number of records to fetch from the Kinesis data stream in each getRecords operation. The
default value is
10000
.
-
addIdleTimeBetweenReads
Adds a time delay between two consecutive getRecords operations. The default value is
"False"
. This option is only configurable for Glue version 2.0 and above.- Returns:
- Adds a time delay between two consecutive getRecords operations. The default value is
"False"
. This option is only configurable for Glue version 2.0 and above.
-
idleTimeBetweenReadsInMs
The minimum time delay between two consecutive getRecords operations, specified in ms. The default value is
1000
. This option is only configurable for Glue version 2.0 and above.- Returns:
- The minimum time delay between two consecutive getRecords operations, specified in ms. The default value
is
1000
. This option is only configurable for Glue version 2.0 and above.
-
describeShardInterval
The minimum time interval between two ListShards API calls for your script to consider resharding. The default value is
1s
.- Returns:
- The minimum time interval between two ListShards API calls for your script to consider resharding. The
default value is
1s
.
-
numRetries
The maximum number of retries for Kinesis Data Streams API requests. The default value is
3
.- Returns:
- The maximum number of retries for Kinesis Data Streams API requests. The default value is
3
.
-
retryIntervalMs
The cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call. The default value is
1000
.- Returns:
- The cool-off time period (specified in ms) before retrying the Kinesis Data Streams API call. The default
value is
1000
.
-
maxRetryIntervalMs
The maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API call. The default value is
10000
.- Returns:
- The maximum cool-off time period (specified in ms) between two retries of a Kinesis Data Streams API
call. The default value is
10000
.
-
avoidEmptyBatches
Avoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the batch is started. The default value is
"False"
.- Returns:
- Avoids creating an empty microbatch job by checking for unread data in the Kinesis data stream before the
batch is started. The default value is
"False"
.
-
streamArn
The Amazon Resource Name (ARN) of the Kinesis data stream.
- Returns:
- The Amazon Resource Name (ARN) of the Kinesis data stream.
-
roleArn
The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS). This role must have permissions for describe or read record operations for the Kinesis data stream. You must use this parameter when accessing a data stream in a different account. Used in conjunction with
"awsSTSSessionName"
.- Returns:
- The Amazon Resource Name (ARN) of the role to assume using AWS Security Token Service (AWS STS). This
role must have permissions for describe or read record operations for the Kinesis data stream. You must
use this parameter when accessing a data stream in a different account. Used in conjunction with
"awsSTSSessionName"
.
-
roleSessionName
An identifier for the session assuming the role using AWS STS. You must use this parameter when accessing a data stream in a different account. Used in conjunction with
"awsSTSRoleARN"
.- Returns:
- An identifier for the session assuming the role using AWS STS. You must use this parameter when accessing
a data stream in a different account. Used in conjunction with
"awsSTSRoleARN"
.
-
addRecordTimestamp
When this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream. The default value is 'false'. This option is supported in Glue version 4.0 or later.
- Returns:
- When this option is set to 'true', the data output will contain an additional column named "__src_timestamp" that indicates the time when the corresponding record received by the stream. The default value is 'false'. This option is supported in Glue version 4.0 or later.
-
emitConsumerLagMetrics
When this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch. The metric's name is "glue.driver.streaming.maxConsumerLagInMs". The default value is 'false'. This option is supported in Glue version 4.0 or later.
- Returns:
- When this option is set to 'true', for each batch, it will emit the metrics for the duration between the oldest record received by the stream and the time it arrives in Glue to CloudWatch. The metric's name is "glue.driver.streaming.maxConsumerLagInMs". The default value is 'false'. This option is supported in Glue version 4.0 or later.
-
startingTimestamp
The timestamp of the record in the Kinesis data stream to start reading data from. The possible values are a timestamp string in UTC format of the pattern
yyyy-mm-ddTHH:MM:SSZ
(where Z represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00+08:00").- Returns:
- The timestamp of the record in the Kinesis data stream to start reading data from. The possible values
are a timestamp string in UTC format of the pattern
yyyy-mm-ddTHH:MM:SSZ
(where Z represents a UTC timezone offset with a +/-. For example: "2023-04-04T08:00:00+08:00").
-
toBuilder
Description copied from interface:ToCopyableBuilder
Take this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilder
in interfaceToCopyableBuilder<KinesisStreamingSourceOptions.Builder,
KinesisStreamingSourceOptions> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
public final int hashCode() -
equals
-
equalsBySdkFields
Description copied from interface:SdkPojo
Indicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojo
class, and is generated based on a service model.If an
SdkPojo
class does not have any inherited fields,equalsBySdkFields
andequals
are essentially the same.- Specified by:
equalsBySdkFields
in interfaceSdkPojo
- Parameters:
obj
- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
Returns a string representation of this object. This is useful for testing and debugging. Sensitive data will be redacted from this string using a placeholder value. -
getValueForField
-
sdkFields
-