This is supported on crate feature io_parquet only.
Expand description

APIs to read from Parquet format.

Re-exports

pub use parquet2::fallible_streaming_iterator;
pub use schema::get_schema;

Modules

APIs to handle Parquet <-> Arrow schemas.

APIs exposing parquet2’s statistics as arrow’s statistics.

Structs

Metadata for a column chunk.

A descriptor for leaf-level primitive columns. This encapsulates information such as definition and repetition levels and is used to re-assemble nested data.

A CompressedDataPage is compressed, encoded representation of a Parquet data page. It holds actual data and thus cloning it is expensive.

A DataPage is an uncompressed, encoded representation of a Parquet data page. It holds actual data and thus cloning it is expensive.

Decompressor that allows re-using the page buffer of PageIterator.

Metadata for a Parquet file.

A page iterator iterates over row group’s pages. In parquet, pages are guaranteed to be contiguously arranged in memory and therefore must be read in sequence.

Single threaded iterator of RecordBatch from a parquet file.

Metadata for a row group.

Timestamp logical type annotation

Enums

Representation of a Parquet type. Used to describe primitive leaf fields and structs, including top-level schema. Note that the top-level schema type is represented using GroupType whose repetition is None.

Traits

A fallible, streaming iterator.

Functions

Returns a stream of compressed data pages

Reads a file’s metadata.

Decompresses the page, using buffer for decompression. If page.buffer.len() == 0, there was no decompression and the buffer was moved. Else, decompression took place.

Creates a new iterator of compressed pages.

Creates a new iterator of compressed pages.

Converts an iterator of DataPage into a single Array.

Converts an async stream of DataPage into a single Array.

Reads parquets’ metadata syncronously.

Reads parquets’ metadata asynchronously.

Type Definitions

Type declaration for a page filter