pub struct List {
pub offsets: Option<Box<ArrayEncoding>>,
pub null_offset_adjustment: u64,
pub num_items: u64,
}
Expand description
An array encoding for variable-length list fields
Fields§
§offsets: Option<Box<ArrayEncoding>>
An array containing the offsets into an items array.
This array will have num_rows items and will never have nulls.
If the list at index i is not null then offsets[i] will
contain base + len(list)
where base
is defined as:
i == 0: 0
i > 0: (offsets[i-1] % null_offset_adjustment)
To help understand we can consider the following example list: [ [A, B], null, [], [C, D, E] ]
The offsets will be [2, ?, 2, 5]
If the incoming list at index i IS null then offsets[i] will
contain base + len(list) + null_offset_adjustment
where base
is defined the same as above.
To complete the above example let’s assume that null_offset_adjustment
is 7. Then the offsets will be [2, 9, 2, 5]
If there are no nulls then the offsets we write here are exactly the same as the offsets in an Arrow list array (except we omit the leading 0 which is redundant)
The reason we do this is so that reading a single list at index i only requires us to load the indices at i and i-1.
If the offset at index i is greater than `null_offset_adjustment`` then the list at index i is null.
Otherwise the length of the list is offsets\[i\] - base
where
base is defined the same as above.
Let’s consider our example offsets: [2, 9, 2, 5]
We can take any range of lists and determine how many list items are referenced by the sublist.
0..3: [, 5] -> items 0..5 (base = 0* and end is 5) 0..2: [, 2] -> items 0..2 (base = 0* and end is 2) 0..1: [_, 9] -> items 0..2 (base = 0* and end is 9 % 7) 1..3: [2, 5] -> items 2..5 (base = 2 and end is 5) 1..2: [2, 2] -> items 2..2 (base = 2 and end is 2) 2..3: [9, 5] -> items 2..5 (base = 9 % 7 and end is 5)
- When the start of our range is the 0th item the base is always 0 and we only need to load a single index from disk to determine the range.
The data type of the offsets array is flexible and does not need to match the data type of the destination array. Please note that the offsets array is very likely to be efficiently encoded by bit packing deltas.
null_offset_adjustment: u64
If a list is null then we add this value to the offset
This value must be greater than the length of the items so that (offset + null_offset_adjustment) is never used by a non-null list.
Note that this value cannot be equal to the length of the items because then a page with a single list would store [ X ] and we couldn’t know if that is a null list or a list with X items.
Therefore, the best choice for this value is 1 + # of items. Choosing this will maximize the bit packing that we can apply to the offsets.
num_items: u64
How many items are referenced by these offsets. This is needed in order to determine which items pages map to this offsets page.
Trait Implementations§
Source§impl Message for List
impl Message for List
Source§fn encoded_len(&self) -> usize
fn encoded_len(&self) -> usize
Source§fn encode<B>(&self, buf: &mut B) -> Result<(), EncodeError>
fn encode<B>(&self, buf: &mut B) -> Result<(), EncodeError>
Source§fn encode_to_vec(&self) -> Vec<u8> ⓘwhere
Self: Sized,
fn encode_to_vec(&self) -> Vec<u8> ⓘwhere
Self: Sized,
Source§fn encode_length_delimited<B>(&self, buf: &mut B) -> Result<(), EncodeError>
fn encode_length_delimited<B>(&self, buf: &mut B) -> Result<(), EncodeError>
Source§fn encode_length_delimited_to_vec(&self) -> Vec<u8> ⓘwhere
Self: Sized,
fn encode_length_delimited_to_vec(&self) -> Vec<u8> ⓘwhere
Self: Sized,
Source§fn decode<B>(buf: B) -> Result<Self, DecodeError>
fn decode<B>(buf: B) -> Result<Self, DecodeError>
Source§fn decode_length_delimited<B>(buf: B) -> Result<Self, DecodeError>
fn decode_length_delimited<B>(buf: B) -> Result<Self, DecodeError>
Source§fn merge<B>(&mut self, buf: B) -> Result<(), DecodeError>
fn merge<B>(&mut self, buf: B) -> Result<(), DecodeError>
self
. Read moreSource§fn merge_length_delimited<B>(&mut self, buf: B) -> Result<(), DecodeError>
fn merge_length_delimited<B>(&mut self, buf: B) -> Result<(), DecodeError>
self
.Source§impl Name for List
impl Name for List
Source§const NAME: &'static str = "List"
const NAME: &'static str = "List"
Message
.
This name is the same as it appears in the source .proto file, e.g. FooBar
.Source§const PACKAGE: &'static str = "lance.encodings"
const PACKAGE: &'static str = "lance.encodings"
.
, e.g. google.protobuf
.Source§fn full_name() -> String
fn full_name() -> String
Message
.
It’s prefixed with the package name and names of any parent messages,
e.g. google.rpc.BadRequest.FieldViolation
.
By default, this is the package name followed by the message name.
Fully-qualified names must be unique within a domain of Type URLs.impl StructuralPartialEq for List
Auto Trait Implementations§
impl Freeze for List
impl RefUnwindSafe for List
impl Send for List
impl Sync for List
impl Unpin for List
impl UnwindSafe for List
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§unsafe fn clone_to_uninit(&self, dst: *mut T)
unsafe fn clone_to_uninit(&self, dst: *mut T)
clone_to_uninit
)Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read more