| Package | Description |
|---|---|
| org.apache.parquet.hadoop |
Provides classes to store use Parquet files in Hadoop
In a map reduce job:
|
| org.apache.parquet.hadoop.mapred |
| Modifier and Type | Method and Description |
|---|---|
List<Footer> |
ParquetInputFormat.getFooters(org.apache.hadoop.conf.Configuration configuration,
Collection<org.apache.hadoop.fs.FileStatus> statuses)
the footers for the files
|
List<Footer> |
ParquetInputFormat.getFooters(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.FileStatus> statuses) |
List<Footer> |
ParquetInputFormat.getFooters(org.apache.hadoop.mapreduce.JobContext jobContext) |
static List<Footer> |
ParquetFileReader.readAllFootersInParallel(org.apache.hadoop.conf.Configuration configuration,
org.apache.hadoop.fs.FileStatus fileStatus)
Read the footers of all the files under that path (recursively)
not using summary files.
|
static List<Footer> |
ParquetFileReader.readAllFootersInParallel(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.FileStatus> partFiles)
Deprecated.
|
static List<Footer> |
ParquetFileReader.readAllFootersInParallel(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.FileStatus> partFiles,
boolean skipRowGroups)
read all the footers of the files provided
(not using summary files)
|
static List<Footer> |
ParquetFileReader.readAllFootersInParallelUsingSummaryFiles(org.apache.hadoop.conf.Configuration configuration,
Collection<org.apache.hadoop.fs.FileStatus> partFiles,
boolean skipRowGroups)
for files provided, check if there's a summary file.
|
static List<Footer> |
ParquetFileReader.readAllFootersInParallelUsingSummaryFiles(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.FileStatus> partFiles)
Deprecated.
|
static List<Footer> |
ParquetFileReader.readFooters(org.apache.hadoop.conf.Configuration configuration,
org.apache.hadoop.fs.FileStatus pathStatus)
Deprecated.
|
static List<Footer> |
ParquetFileReader.readFooters(org.apache.hadoop.conf.Configuration configuration,
org.apache.hadoop.fs.FileStatus pathStatus,
boolean skipRowGroups)
Read the footers of all the files under that path (recursively)
using summary files if possible
|
static List<Footer> |
ParquetFileReader.readFooters(org.apache.hadoop.conf.Configuration configuration,
org.apache.hadoop.fs.Path path)
Deprecated.
|
static List<Footer> |
ParquetFileReader.readSummaryFile(org.apache.hadoop.conf.Configuration configuration,
org.apache.hadoop.fs.FileStatus summaryStatus)
Specifically reads a given summary file
|
| Modifier and Type | Method and Description |
|---|---|
List<ParquetInputSplit> |
ParquetInputFormat.getSplits(org.apache.hadoop.conf.Configuration configuration,
List<Footer> footers)
Deprecated.
split planning using file footers will be removed
|
static void |
ParquetFileWriter.writeMetadataFile(org.apache.hadoop.conf.Configuration configuration,
org.apache.hadoop.fs.Path outputPath,
List<Footer> footers)
writes a _metadata and _common_metadata file
|
| Modifier and Type | Method and Description |
|---|---|
List<Footer> |
DeprecatedParquetInputFormat.getFooters(org.apache.hadoop.mapred.JobConf job) |
Copyright © 2018 The Apache Software Foundation. All rights reserved.