polars_parquet::arrow

Module read

Source
Expand description

APIs to read from Parquet format.

Re-exports§

pub use schema::infer_schema;
pub use schema::FileMetadata;
pub use crate::parquet::error::ParquetError;
pub use crate::parquet::metadata::ColumnChunkMetadata;
pub use crate::parquet::metadata::ColumnDescriptor;
pub use crate::parquet::metadata::RowGroupMetadata;
pub use crate::parquet::page::CompressedDataPage;
pub use crate::parquet::page::DataPageHeader;
pub use crate::parquet::page::Page;
pub use crate::parquet::read::decompress;
pub use crate::parquet::read::get_column_iterator;
pub use crate::parquet::read::read_metadata as _read_metadata;
pub use crate::parquet::read::BasicDecompressor;
pub use crate::parquet::read::MutStreamingIterator;
pub use crate::parquet::read::PageReader;
pub use crate::parquet::read::ReadColumnIterator;
pub use crate::parquet::read::State;
pub use crate::parquet::schema::types::GroupLogicalType;
pub use crate::parquet::schema::types::ParquetType;
pub use crate::parquet::schema::types::PhysicalType;
pub use crate::parquet::schema::types::PrimitiveConvertedType;
pub use crate::parquet::schema::types::PrimitiveLogicalType;
pub use crate::parquet::schema::types::TimeUnit as ParquetTimeUnit;
pub use crate::parquet::types::int96_to_i64_ns;
pub use crate::parquet::fallible_streaming_iterator;

Modules§

schema
APIs to handle Parquet <-> Arrow schemas.
statistics
APIs exposing crate::parquet’s statistics as arrow’s statistics.

Structs§

NestedState
The state of nested data types.

Enums§

Filter
InitNested
The initial info of nested data types. The bool indicates if the type is nullable.

Traits§

FallibleStreamingIterator
A fallible, streaming iterator.

Functions§

column_iter_to_arrays
An iterator adapter that maps multiple iterators of [PagesIter] into an iterator of Arrays.
create_list
Creates a new ListArray or FixedSizeListArray.
create_map
Creates a new MapArray.
get_field_pages
Returns all ColumnChunkMetadata associated to field_name. For non-nested parquet types, this returns a single column
get_page_iterator
Creates a new iterator of compressed pages.
init_nested
Initialize NestedState from &[InitNested].
n_columns
Returns the number of (parquet) columns that a ArrowDataType contains.
read_metadata
Reads parquets’ metadata synchronously.