Interface ChunkingConfiguration.Builder

  • Method Details

    • chunkingStrategy

      ChunkingConfiguration.Builder chunkingStrategy(String chunkingStrategy)

      Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.

      • FIXED_SIZE – Amazon Bedrock splits your source data into chunks of the approximate size that you set in the fixedSizeChunkingConfiguration.

      • HIERARCHICAL – Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

      • SEMANTIC – Split documents into chunks based on groups of similar content derived with natural language processing.

      • NONE – Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.

      Parameters:
      chunkingStrategy - Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.

      • FIXED_SIZE – Amazon Bedrock splits your source data into chunks of the approximate size that you set in the fixedSizeChunkingConfiguration.

      • HIERARCHICAL – Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

      • SEMANTIC – Split documents into chunks based on groups of similar content derived with natural language processing.

      • NONE – Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.

      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • chunkingStrategy

      ChunkingConfiguration.Builder chunkingStrategy(ChunkingStrategy chunkingStrategy)

      Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.

      • FIXED_SIZE – Amazon Bedrock splits your source data into chunks of the approximate size that you set in the fixedSizeChunkingConfiguration.

      • HIERARCHICAL – Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

      • SEMANTIC – Split documents into chunks based on groups of similar content derived with natural language processing.

      • NONE – Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.

      Parameters:
      chunkingStrategy - Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for NONE, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.

      • FIXED_SIZE – Amazon Bedrock splits your source data into chunks of the approximate size that you set in the fixedSizeChunkingConfiguration.

      • HIERARCHICAL – Split documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

      • SEMANTIC – Split documents into chunks based on groups of similar content derived with natural language processing.

      • NONE – Amazon Bedrock treats each file as one chunk. If you choose this option, you may want to pre-process your documents by splitting them into separate files.

      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • fixedSizeChunkingConfiguration

      ChunkingConfiguration.Builder fixedSizeChunkingConfiguration(FixedSizeChunkingConfiguration fixedSizeChunkingConfiguration)

      Configurations for when you choose fixed-size chunking. If you set the chunkingStrategy as NONE, exclude this field.

      Parameters:
      fixedSizeChunkingConfiguration - Configurations for when you choose fixed-size chunking. If you set the chunkingStrategy as NONE, exclude this field.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • fixedSizeChunkingConfiguration

      default ChunkingConfiguration.Builder fixedSizeChunkingConfiguration(Consumer<FixedSizeChunkingConfiguration.Builder> fixedSizeChunkingConfiguration)

      Configurations for when you choose fixed-size chunking. If you set the chunkingStrategy as NONE, exclude this field.

      This is a convenience method that creates an instance of the FixedSizeChunkingConfiguration.Builder avoiding the need to create one manually via FixedSizeChunkingConfiguration.builder().

      When the Consumer completes, SdkBuilder.build() is called immediately and its result is passed to fixedSizeChunkingConfiguration(FixedSizeChunkingConfiguration).

      Parameters:
      fixedSizeChunkingConfiguration - a consumer that will call methods on FixedSizeChunkingConfiguration.Builder
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • hierarchicalChunkingConfiguration

      ChunkingConfiguration.Builder hierarchicalChunkingConfiguration(HierarchicalChunkingConfiguration hierarchicalChunkingConfiguration)

      Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

      Parameters:
      hierarchicalChunkingConfiguration - Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • hierarchicalChunkingConfiguration

      default ChunkingConfiguration.Builder hierarchicalChunkingConfiguration(Consumer<HierarchicalChunkingConfiguration.Builder> hierarchicalChunkingConfiguration)

      Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.

      This is a convenience method that creates an instance of the HierarchicalChunkingConfiguration.Builder avoiding the need to create one manually via HierarchicalChunkingConfiguration.builder().

      When the Consumer completes, SdkBuilder.build() is called immediately and its result is passed to hierarchicalChunkingConfiguration(HierarchicalChunkingConfiguration).

      Parameters:
      hierarchicalChunkingConfiguration - a consumer that will call methods on HierarchicalChunkingConfiguration.Builder
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • semanticChunkingConfiguration

      ChunkingConfiguration.Builder semanticChunkingConfiguration(SemanticChunkingConfiguration semanticChunkingConfiguration)

      Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.

      Parameters:
      semanticChunkingConfiguration - Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • semanticChunkingConfiguration

      default ChunkingConfiguration.Builder semanticChunkingConfiguration(Consumer<SemanticChunkingConfiguration.Builder> semanticChunkingConfiguration)

      Settings for semantic document chunking for a data source. Semantic chunking splits a document into into smaller documents based on groups of similar content derived from the text with natural language processing.

      This is a convenience method that creates an instance of the SemanticChunkingConfiguration.Builder avoiding the need to create one manually via SemanticChunkingConfiguration.builder().

      When the Consumer completes, SdkBuilder.build() is called immediately and its result is passed to semanticChunkingConfiguration(SemanticChunkingConfiguration).

      Parameters:
      semanticChunkingConfiguration - a consumer that will call methods on SemanticChunkingConfiguration.Builder
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also: