Class ChunkingConfiguration
- All Implemented Interfaces:
Serializable
,SdkPojo
,ToCopyableBuilder<ChunkingConfiguration.Builder,
ChunkingConfiguration>
Details about how to chunk the documents in the data source. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionbuilder()
final ChunkingStrategy
Knowledge base can split your source data into chunks.final String
Knowledge base can split your source data into chunks.final boolean
final boolean
equalsBySdkFields
(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.Configurations for when you choose fixed-size chunking.final <T> Optional
<T> getValueForField
(String fieldName, Class<T> clazz) final int
hashCode()
Settings for hierarchical document chunking for a data source.Settings for semantic document chunking for a data source.static Class
<? extends ChunkingConfiguration.Builder> Take this object and create a builder that contains all of the current property values of this object.final String
toString()
Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
chunkingStrategy
Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for
NONE
, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.-
FIXED_SIZE
– Amazon Bedrock splits your source data into chunks of the approximate size that you set in thefixedSizeChunkingConfiguration
. -
HIERARCHICAL
– Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer. -
SEMANTIC
– Split documents into chunks based on groups of similar content derived with natural language processing. -
NONE
– Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.
If the service returns an enum value that is not available in the current SDK version,
chunkingStrategy
will returnChunkingStrategy.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromchunkingStrategyAsString()
.- Returns:
- Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data
source that is returned when the knowledge base that it belongs to is queried. You have the following
options for chunking your data. If you opt for
NONE
, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.-
FIXED_SIZE
– Amazon Bedrock splits your source data into chunks of the approximate size that you set in thefixedSizeChunkingConfiguration
. -
HIERARCHICAL
– Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer. -
SEMANTIC
– Split documents into chunks based on groups of similar content derived with natural language processing. -
NONE
– Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.
-
- See Also:
-
-
chunkingStrategyAsString
Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for
NONE
, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.-
FIXED_SIZE
– Amazon Bedrock splits your source data into chunks of the approximate size that you set in thefixedSizeChunkingConfiguration
. -
HIERARCHICAL
– Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer. -
SEMANTIC
– Split documents into chunks based on groups of similar content derived with natural language processing. -
NONE
– Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.
If the service returns an enum value that is not available in the current SDK version,
chunkingStrategy
will returnChunkingStrategy.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromchunkingStrategyAsString()
.- Returns:
- Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data
source that is returned when the knowledge base that it belongs to is queried. You have the following
options for chunking your data. If you opt for
NONE
, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.-
FIXED_SIZE
– Amazon Bedrock splits your source data into chunks of the approximate size that you set in thefixedSizeChunkingConfiguration
. -
HIERARCHICAL
– Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer. -
SEMANTIC
– Split documents into chunks based on groups of similar content derived with natural language processing. -
NONE
– Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.
-
- See Also:
-
-
fixedSizeChunkingConfiguration
Configurations for when you choose fixed-size chunking. If you set the
chunkingStrategy
asNONE
, exclude this field.- Returns:
- Configurations for when you choose fixed-size chunking. If you set the
chunkingStrategy
asNONE
, exclude this field.
-
hierarchicalChunkingConfiguration
Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.
- Returns:
- Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.
-
semanticChunkingConfiguration
Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.
- Returns:
- Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.
-
toBuilder
Description copied from interface:ToCopyableBuilder
Take this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilder
in interfaceToCopyableBuilder<ChunkingConfiguration.Builder,
ChunkingConfiguration> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojo
Indicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojo
class, and is generated based on a service model.If an
SdkPojo
class does not have any inherited fields,equalsBySdkFields
andequals
are essentially the same.- Specified by:
equalsBySdkFields
in interfaceSdkPojo
- Parameters:
obj
- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToField
in interfaceSdkPojo
- Returns:
- The mapping between the field name and its corresponding field.
-