Interface SeedUrlConfiguration.Builder
- All Superinterfaces:
Buildable
,CopyableBuilder<SeedUrlConfiguration.Builder,
,SeedUrlConfiguration> SdkBuilder<SeedUrlConfiguration.Builder,
,SeedUrlConfiguration> SdkPojo
- Enclosing class:
SeedUrlConfiguration
-
Method Summary
Modifier and TypeMethodDescriptionThe list of seed or starting point URLs of the websites you want to crawl.seedUrls
(Collection<String> seedUrls) The list of seed or starting point URLs of the websites you want to crawl.webCrawlerMode
(String webCrawlerMode) You can choose one of the following modes:webCrawlerMode
(WebCrawlerMode webCrawlerMode) You can choose one of the following modes:Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFields
-
Method Details
-
seedUrls
The list of seed or starting point URLs of the websites you want to crawl.
The list can include a maximum of 100 seed URLs.
- Parameters:
seedUrls
- The list of seed or starting point URLs of the websites you want to crawl.The list can include a maximum of 100 seed URLs.
- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
seedUrls
The list of seed or starting point URLs of the websites you want to crawl.
The list can include a maximum of 100 seed URLs.
- Parameters:
seedUrls
- The list of seed or starting point URLs of the websites you want to crawl.The list can include a maximum of 100 seed URLs.
- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
webCrawlerMode
You can choose one of the following modes:
-
HOST_ONLY
—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS
—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING
—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY
.- Parameters:
webCrawlerMode
- You can choose one of the following modes:-
HOST_ONLY
—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS
—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING
—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY
.-
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
-
webCrawlerMode
You can choose one of the following modes:
-
HOST_ONLY
—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS
—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING
—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY
.- Parameters:
webCrawlerMode
- You can choose one of the following modes:-
HOST_ONLY
—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS
—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING
—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY
.-
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
-