io_parquet
only.Expand description
APIs to read from Parquet format.
Re-exports
pub use parquet2::fallible_streaming_iterator;
pub use schema::get_schema;
Modules
APIs to handle Parquet <-> Arrow schemas.
APIs exposing parquet2
’s statistics as arrow’s statistics.
Structs
A FallibleStreamingIterator
that decompresses CompressedDataPage
into DataPage
.
Metadata for a column chunk.
A descriptor for leaf-level primitive columns. This encapsulates information such as definition and repetition levels and is used to re-assemble nested data.
A CompressedDataPage
is compressed, encoded representation of a Parquet data page.
It holds actual data and thus cloning it is expensive.
A DataPage
is an uncompressed, encoded representation of a Parquet data page. It holds actual data
and thus cloning it is expensive.
Decompressor that allows re-using the page buffer of PageIterator
.
Metadata for a Parquet file.
A page iterator iterates over row group’s pages. In parquet, pages are guaranteed to be contiguously arranged in memory and therefore must be read in sequence.
Single threaded iterator of RecordBatch
from a parquet file.
Metadata for a row group.
Timestamp logical type annotation
Enums
Representation of a Parquet type.
Used to describe primitive leaf fields and structs, including top-level schema.
Note that the top-level schema type is represented using GroupType
whose
repetition is None
.
Traits
A fallible, streaming iterator.
Functions
Returns a stream of compressed data pages
Reads a file’s metadata.
Decompresses the page, using buffer
for decompression.
If page.buffer.len() == 0
, there was no decompression and the buffer was moved.
Else, decompression took place.
Creates a new iterator of compressed pages.
Creates a new iterator of compressed pages.
Reads parquets’ metadata syncronously.
Reads parquets’ metadata asynchronously.
Type Definitions
Type declaration for a page filter