Class Crawler
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<Crawler.Builder,Crawler>
Specifies a crawler program that examines a data source and uses classifiers to try to determine its schema. If successful, the crawler records metadata concerning the data source in the Glue Data Catalog.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionstatic Crawler.Builderbuilder()A list of UTF-8 strings that specify the custom classifiers that are associated with the crawler.final StringCrawler configuration information.final LongIf the crawler is running, contains the total time elapsed since the last crawl began.final StringThe name of theSecurityConfigurationstructure to be used by this crawler.final InstantThe time that the crawler was created.final StringThe name of the database in which the crawler's output is stored.final StringA description of the crawler.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final booleanFor responses, this returns true if the service returned a value for the Classifiers property.final inthashCode()Specifies whether the crawler should use Lake Formation credentials for the crawler instead of the IAM role credentials.final LastCrawlInfoThe status of the last crawl, and potentially error information if an error occurred.final InstantThe time that the crawler was last updated.final LineageConfigurationA configuration that specifies whether data lineage is enabled for the crawler.final Stringname()The name of the crawler.final RecrawlPolicyA policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.final Stringrole()The Amazon Resource Name (ARN) of an IAM role that's used to access customer resources, such as Amazon Simple Storage Service (Amazon S3) data.final Scheduleschedule()For scheduled crawlers, the schedule when the crawler runs.final SchemaChangePolicyThe policy that specifies update and delete behaviors for the crawler.static Class<? extends Crawler.Builder> final CrawlerStatestate()Indicates whether the crawler is running, or whether a run is pending.final StringIndicates whether the crawler is running, or whether a run is pending.final StringThe prefix added to the names of tables that are created.final CrawlerTargetstargets()A collection of targets to crawl.Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.final Longversion()The version of the crawler.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
name
-
role
The Amazon Resource Name (ARN) of an IAM role that's used to access customer resources, such as Amazon Simple Storage Service (Amazon S3) data.
- Returns:
- The Amazon Resource Name (ARN) of an IAM role that's used to access customer resources, such as Amazon Simple Storage Service (Amazon S3) data.
-
targets
A collection of targets to crawl.
- Returns:
- A collection of targets to crawl.
-
databaseName
The name of the database in which the crawler's output is stored.
- Returns:
- The name of the database in which the crawler's output is stored.
-
description
A description of the crawler.
- Returns:
- A description of the crawler.
-
hasClassifiers
public final boolean hasClassifiers()For responses, this returns true if the service returned a value for the Classifiers property. This DOES NOT check that the value is non-empty (for which, you should check theisEmpty()method on the property). This is useful because the SDK will never return a null collection or map, but you may need to differentiate between the service returning nothing (or null) and the service returning an empty collection or map. For requests, this returns true if a value for the property was specified in the request builder, and false if a value was not specified. -
classifiers
A list of UTF-8 strings that specify the custom classifiers that are associated with the crawler.
Attempts to modify the collection returned by this method will result in an UnsupportedOperationException.
This method will never return null. If you would like to know whether the service returned this field (so that you can differentiate between null and empty), you can use the
hasClassifiers()method.- Returns:
- A list of UTF-8 strings that specify the custom classifiers that are associated with the crawler.
-
recrawlPolicy
A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.
- Returns:
- A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.
-
schemaChangePolicy
The policy that specifies update and delete behaviors for the crawler.
- Returns:
- The policy that specifies update and delete behaviors for the crawler.
-
lineageConfiguration
A configuration that specifies whether data lineage is enabled for the crawler.
- Returns:
- A configuration that specifies whether data lineage is enabled for the crawler.
-
state
Indicates whether the crawler is running, or whether a run is pending.
If the service returns an enum value that is not available in the current SDK version,
statewill returnCrawlerState.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstateAsString().- Returns:
- Indicates whether the crawler is running, or whether a run is pending.
- See Also:
-
stateAsString
Indicates whether the crawler is running, or whether a run is pending.
If the service returns an enum value that is not available in the current SDK version,
statewill returnCrawlerState.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstateAsString().- Returns:
- Indicates whether the crawler is running, or whether a run is pending.
- See Also:
-
tablePrefix
The prefix added to the names of tables that are created.
- Returns:
- The prefix added to the names of tables that are created.
-
schedule
For scheduled crawlers, the schedule when the crawler runs.
- Returns:
- For scheduled crawlers, the schedule when the crawler runs.
-
crawlElapsedTime
If the crawler is running, contains the total time elapsed since the last crawl began.
- Returns:
- If the crawler is running, contains the total time elapsed since the last crawl began.
-
creationTime
The time that the crawler was created.
- Returns:
- The time that the crawler was created.
-
lastUpdated
The time that the crawler was last updated.
- Returns:
- The time that the crawler was last updated.
-
lastCrawl
The status of the last crawl, and potentially error information if an error occurred.
- Returns:
- The status of the last crawl, and potentially error information if an error occurred.
-
version
-
configuration
Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see Setting crawler configuration options.
- Returns:
- Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see Setting crawler configuration options.
-
crawlerSecurityConfiguration
The name of the
SecurityConfigurationstructure to be used by this crawler.- Returns:
- The name of the
SecurityConfigurationstructure to be used by this crawler.
-
lakeFormationConfiguration
Specifies whether the crawler should use Lake Formation credentials for the crawler instead of the IAM role credentials.
- Returns:
- Specifies whether the crawler should use Lake Formation credentials for the crawler instead of the IAM role credentials.
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<Crawler.Builder,Crawler> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-