lance_encoding::format

Module pb

Source
Expand description

Protobuf definitions for encodings

Modules§

  • Nested message and enum types in ArrayEncoding.
  • Nested message and enum types in Buffer.
  • Nested message and enum types in ColumnEncoding.
  • Nested message and enum types in Nullable.
  • Nested message and enum types in PageLayout.

Structs§

  • / A layout used for pages where all values are null / / In addition, there can be no repetition levels and only a single definition level / / If the data is all-null but we have non-trivial rep-def then MiniBlockLayout is used
  • Encodings that decode into an Arrow array
  • An array encoding for binary fields
  • Items are bitpacked in a buffer
  • Items are bitpacked in a buffer
  • Marks a column as blob data. It will contain a packed struct with fields position and size (u64)
  • A pointer to a buffer in a Lance file
  • Encodings that describe a column of values
  • Compression algorithm where all values have a constant value
  • An array encoding for dictionary-encoded fields
  • An array encoding for fixed-size list fields
  • Fixed width items placed contiguously in a buffer
  • / A layout used for pages where the data is large / / In this case the cost of transposing the data is relatively small (compared to the cost of writing the data) / and so we just zip the buffers together
  • An array encoding for variable-length list fields
  • / A layout used for pages where the data is small / / In this case we can fit many values into a single disk sector and transposing buffers is / expensive. As a result, we do not transpose the buffers but compress the data into small / chunks (called mini blocks) which are roughly the size of a disk sector.
  • An encoding that adds nullability to another array encoding
  • An array encoding for shredded structs that will never be null
  • Wraps a column with a zone map index that can be used to apply pushdown filters