Interface ParquetSerDe.Builder
- All Superinterfaces:
Buildable
,CopyableBuilder<ParquetSerDe.Builder,
,ParquetSerDe> SdkBuilder<ParquetSerDe.Builder,
,ParquetSerDe> SdkPojo
- Enclosing class:
ParquetSerDe
-
Method Summary
Modifier and TypeMethodDescriptionblockSizeBytes
(Integer blockSizeBytes) The Hadoop Distributed File System (HDFS) block size.compression
(String compression) The compression code to use over data blocks.compression
(ParquetCompression compression) The compression code to use over data blocks.enableDictionaryCompression
(Boolean enableDictionaryCompression) Indicates whether to enable dictionary compression.maxPaddingBytes
(Integer maxPaddingBytes) The maximum amount of padding to apply.pageSizeBytes
(Integer pageSizeBytes) The Parquet page size.writerVersion
(String writerVersion) Indicates the version of row format to output.writerVersion
(ParquetWriterVersion writerVersion) Indicates the version of row format to output.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFields
-
Method Details
-
blockSizeBytes
The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.
- Parameters:
blockSizeBytes
- The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
pageSizeBytes
The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
- Parameters:
pageSizeBytes
- The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
compression
The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.- Parameters:
compression
- The compression code to use over data blocks. The possible values areUNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
compression
The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.- Parameters:
compression
- The compression code to use over data blocks. The possible values areUNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
enableDictionaryCompression
Indicates whether to enable dictionary compression.
- Parameters:
enableDictionaryCompression
- Indicates whether to enable dictionary compression.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
maxPaddingBytes
The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
- Parameters:
maxPaddingBytes
- The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
writerVersion
Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
.- Parameters:
writerVersion
- Indicates the version of row format to output. The possible values areV1
andV2
. The default isV1
.- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
writerVersion
Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
.- Parameters:
writerVersion
- Indicates the version of row format to output. The possible values areV1
andV2
. The default isV1
.- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-