Interface ParquetSerDe.Builder

All Superinterfaces:
Buildable, CopyableBuilder<ParquetSerDe.Builder,ParquetSerDe>, SdkBuilder<ParquetSerDe.Builder,ParquetSerDe>, SdkPojo
Enclosing class:
ParquetSerDe

public static interface ParquetSerDe.Builder extends SdkPojo, CopyableBuilder<ParquetSerDe.Builder,ParquetSerDe>
  • Method Details

    • blockSizeBytes

      ParquetSerDe.Builder blockSizeBytes(Integer blockSizeBytes)

      The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.

      Parameters:
      blockSizeBytes - The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • pageSizeBytes

      ParquetSerDe.Builder pageSizeBytes(Integer pageSizeBytes)

      The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

      Parameters:
      pageSizeBytes - The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • compression

      ParquetSerDe.Builder compression(String compression)

      The compression code to use over data blocks. The possible values are UNCOMPRESSED, SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY for higher decompression speed. Use GZIP if the compression ratio is more important than speed.

      Parameters:
      compression - The compression code to use over data blocks. The possible values are UNCOMPRESSED, SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY for higher decompression speed. Use GZIP if the compression ratio is more important than speed.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • compression

      ParquetSerDe.Builder compression(ParquetCompression compression)

      The compression code to use over data blocks. The possible values are UNCOMPRESSED, SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY for higher decompression speed. Use GZIP if the compression ratio is more important than speed.

      Parameters:
      compression - The compression code to use over data blocks. The possible values are UNCOMPRESSED, SNAPPY, and GZIP, with the default being SNAPPY. Use SNAPPY for higher decompression speed. Use GZIP if the compression ratio is more important than speed.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • enableDictionaryCompression

      ParquetSerDe.Builder enableDictionaryCompression(Boolean enableDictionaryCompression)

      Indicates whether to enable dictionary compression.

      Parameters:
      enableDictionaryCompression - Indicates whether to enable dictionary compression.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • maxPaddingBytes

      ParquetSerDe.Builder maxPaddingBytes(Integer maxPaddingBytes)

      The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.

      Parameters:
      maxPaddingBytes - The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • writerVersion

      ParquetSerDe.Builder writerVersion(String writerVersion)

      Indicates the version of row format to output. The possible values are V1 and V2. The default is V1.

      Parameters:
      writerVersion - Indicates the version of row format to output. The possible values are V1 and V2. The default is V1.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • writerVersion

      ParquetSerDe.Builder writerVersion(ParquetWriterVersion writerVersion)

      Indicates the version of row format to output. The possible values are V1 and V2. The default is V1.

      Parameters:
      writerVersion - Indicates the version of row format to output. The possible values are V1 and V2. The default is V1.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also: