polars_arrow::array

Struct DictionaryArray

Source
pub struct DictionaryArray<K: DictionaryKey> { /* private fields */ }
Expand description

An Array whose values are stored as indices. This Array is useful when the cardinality of values is low compared to the length of the Array.

§Safety

This struct guarantees that each item of DictionaryArray::keys is castable to usize and its value is smaller than DictionaryArray::values.len(). In other words, you can safely use unchecked calls to retrieve the values

Implementations§

Source§

impl<K: DictionaryKey> DictionaryArray<K>

Source

pub fn try_new( dtype: ArrowDataType, keys: PrimitiveArray<K>, values: Box<dyn Array>, ) -> PolarsResult<Self>

Returns a new DictionaryArray.

§Implementation

This function is O(N) where N is the length of keys

§Errors

This function errors iff

  • the dtype’s logical type is not a DictionaryArray
  • the dtype’s keys is not compatible with keys
  • the dtype’s values’s dtype is not equal with values.dtype()
  • any of the keys’s values is not represented in usize or is >= values.len()
Source

pub fn try_from_keys( keys: PrimitiveArray<K>, values: Box<dyn Array>, ) -> PolarsResult<Self>

Returns a new DictionaryArray.

§Implementation

This function is O(N) where N is the length of keys

§Errors

This function errors iff

  • any of the keys’s values is not represented in usize or is >= values.len()
Source

pub unsafe fn try_new_unchecked( dtype: ArrowDataType, keys: PrimitiveArray<K>, values: Box<dyn Array>, ) -> PolarsResult<Self>

Returns a new DictionaryArray.

§Errors

This function errors iff

  • the dtype’s logical type is not a DictionaryArray
  • the dtype’s keys is not compatible with keys
  • the dtype’s values’s dtype is not equal with values.dtype()
§Safety

The caller must ensure that every keys’s values is represented in usize and is < values.len()

Source

pub fn new_empty(dtype: ArrowDataType) -> Self

Returns a new empty DictionaryArray.

Source

pub fn new_null(dtype: ArrowDataType, length: usize) -> Self

Returns an DictionaryArray whose all elements are null

Source

pub fn iter( &self, ) -> ZipValidity<Box<dyn Scalar>, DictionaryValuesIter<'_, K>, BitmapIter<'_>>

Returns an iterator of Option<Box<dyn Scalar>>.

§Implementation

This function will allocate a new Scalar per item and is usually not performant. Consider calling keys_iter and values, downcasting values, and iterating over that.

Source

pub fn values_iter(&self) -> DictionaryValuesIter<'_, K>

Returns an iterator of Box<dyn Scalar>

§Implementation

This function will allocate a new Scalar per item and is usually not performant. Consider calling keys_iter and values, downcasting values, and iterating over that.

Source

pub fn values_iter_typed<V: DictValue>( &self, ) -> PolarsResult<DictionaryValuesIterTyped<'_, K, V>>

Returns an iterator over the values [V::IterValue].

§Panics

Panics if the keys of this DictionaryArray has any nulls. If they do DictionaryArray::iter_typed should be used.

Source

pub fn iter_typed<V: DictValue>( &self, ) -> PolarsResult<DictionaryIterTyped<'_, K, V>>

Returns an iterator over the optional values of Option<V::IterValue>.

Source

pub fn dtype(&self) -> &ArrowDataType

Returns the ArrowDataType of this DictionaryArray

Source

pub fn is_ordered(&self) -> bool

Returns whether the values of this DictionaryArray are ordered

Source

pub fn slice(&mut self, offset: usize, length: usize)

Slices this DictionaryArray.

§Panics

iff offset + length > self.len().

Source

pub unsafe fn slice_unchecked(&mut self, offset: usize, length: usize)

Slices this DictionaryArray.

§Safety

Safe iff offset + length <= self.len().

Source

pub fn sliced(self, offset: usize, length: usize) -> Self

Returns this array sliced.

§Implementation

This function is O(1).

§Panics

iff offset + length > self.len().

Source

pub unsafe fn sliced_unchecked(self, offset: usize, length: usize) -> Self

Returns this array sliced.

§Implementation

This function is O(1).

§Safety

The caller must ensure that offset + length <= self.len().

Source

pub fn with_validity(self, validity: Option<Bitmap>) -> Self

Returns this DictionaryArray with a new validity.

§Panic

This function panics iff validity.len() != self.len().

Source

pub fn set_validity(&mut self, validity: Option<Bitmap>)

Sets the validity of the keys of this DictionaryArray.

§Panics

This function panics iff validity.len() != self.len().

Source

pub fn boxed(self) -> Box<dyn Array>

Boxes this array into a Box<dyn Array>.

Source

pub fn arced(self) -> Arc<dyn Array>

Arcs this array into a std::sync::Arc<dyn Array>.

Source

pub fn len(&self) -> usize

Returns the length of this array

Source

pub fn validity(&self) -> Option<&Bitmap>

The optional validity. Equivalent to self.keys().validity().

Source

pub fn keys(&self) -> &PrimitiveArray<K>

Returns the keys of the DictionaryArray. These keys can be used to fetch values from values.

Source

pub fn keys_values_iter(&self) -> impl TrustedLen<Item = usize> + Clone + '_

Returns an iterator of the keys’ values of the DictionaryArray as usize

Source

pub fn keys_iter(&self) -> impl TrustedLen<Item = Option<usize>> + Clone + '_

Returns an iterator of the keys’ of the DictionaryArray as usize

Source

pub fn key_value(&self, index: usize) -> usize

Returns the keys’ value of the DictionaryArray as usize

§Panics

This function panics iff index >= self.len()

Source

pub fn values(&self) -> &Box<dyn Array>

Returns the values of the DictionaryArray.

Source

pub fn value(&self, index: usize) -> Box<dyn Scalar>

Returns the value of the DictionaryArray at position i.

§Implementation

This function will allocate a new Scalar and is usually not performant. Consider calling keys and values, downcasting values, and iterating over that.

§Panic

This function panics iff index >= self.len()

Source

pub fn take(self) -> (ArrowDataType, PrimitiveArray<K>, Box<dyn Array>)

Trait Implementations§

Source§

impl<K: DictionaryKey> Array for DictionaryArray<K>

Source§

fn as_any(&self) -> &dyn Any

Converts itself to a reference of Any, which enables downcasting to concrete types.
Source§

fn as_any_mut(&mut self) -> &mut dyn Any

Converts itself to a mutable reference of Any, which enables mutable downcasting to concrete types.
Source§

fn len(&self) -> usize

The length of the Array. Every array has a length corresponding to the number of elements (slots).
Source§

fn dtype(&self) -> &ArrowDataType

The ArrowDataType of the Array. In combination with Array::as_any, this can be used to downcast trait objects (dyn Array) to concrete arrays.
Source§

fn split_at_boxed(&self, offset: usize) -> (Box<dyn Array>, Box<dyn Array>)

Split Self at offset into two boxed Arrays where offset <= self.len().
Source§

unsafe fn split_at_boxed_unchecked( &self, offset: usize, ) -> (Box<dyn Array>, Box<dyn Array>)

Split Self at offset into two boxed Arrays without checking offset <= self.len(). Read more
Source§

fn slice(&mut self, offset: usize, length: usize)

Slices this Array. Read more
Source§

unsafe fn slice_unchecked(&mut self, offset: usize, length: usize)

Slices the Array. Read more
Source§

fn to_boxed(&self) -> Box<dyn Array>

Clone a &dyn Array to an owned Box<dyn Array>.
Source§

fn validity(&self) -> Option<&Bitmap>

The validity of the Array: every array has an optional Bitmap that, when available specifies whether the array slot is valid or not (null). When the validity is None, all slots are valid.
Source§

fn with_validity(&self, validity: Option<Bitmap>) -> Box<dyn Array>

Clones this Array with a new assigned bitmap. Read more
Source§

fn is_empty(&self) -> bool

whether the array is empty
Source§

fn null_count(&self) -> usize

The number of null slots on this Array. Read more
Source§

fn has_nulls(&self) -> bool

Source§

fn is_null(&self, i: usize) -> bool

Returns whether slot i is null. Read more
Source§

unsafe fn is_null_unchecked(&self, i: usize) -> bool

Returns whether slot i is null. Read more
Source§

fn is_valid(&self, i: usize) -> bool

Returns whether slot i is valid. Read more
Source§

fn sliced(&self, offset: usize, length: usize) -> Box<dyn Array>

Returns a slice of this Array. Read more
Source§

unsafe fn sliced_unchecked( &self, offset: usize, length: usize, ) -> Box<dyn Array>

Returns a slice of this Array. Read more
Source§

impl<K: Clone + DictionaryKey> Clone for DictionaryArray<K>

Source§

fn clone(&self) -> DictionaryArray<K>

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<K: DictionaryKey> Debug for DictionaryArray<K>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<'a, T: DictionaryKey> From<GrowableDictionary<'a, T>> for DictionaryArray<T>

Source§

fn from(val: GrowableDictionary<'a, T>) -> Self

Converts to this type from the input type.
Source§

impl<K: DictionaryKey, M: MutableArray> From<MutableDictionaryArray<K, M>> for DictionaryArray<K>

Source§

fn from(other: MutableDictionaryArray<K, M>) -> Self

Converts to this type from the input type.
Source§

impl<'a, K: DictionaryKey> IntoIterator for &'a DictionaryArray<K>

Source§

type Item = Option<Box<dyn Scalar>>

The type of the elements being iterated over.
Source§

type IntoIter = ZipValidity<Box<dyn Scalar>, DictionaryValuesIter<'a, K>, BitmapIter<'a>>

Which kind of iterator are we turning this into?
Source§

fn into_iter(self) -> Self::IntoIter

Creates an iterator from a value. Read more
Source§

impl<K: DictionaryKey> PartialEq<&(dyn Array + 'static)> for DictionaryArray<K>

Source§

fn eq(&self, other: &&dyn Array) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl<K: DictionaryKey> PartialEq for DictionaryArray<K>

Source§

fn eq(&self, other: &Self) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl<K: DictionaryKey> Splitable for DictionaryArray<K>

Source§

fn check_bound(&self, offset: usize) -> bool

Source§

unsafe fn _split_at_unchecked(&self, offset: usize) -> (Self, Self)

Internal implementation of split_at_unchecked. For any usage, prefer the using split_at or split_at_unchecked. Read more
Source§

fn split_at(&self, offset: usize) -> (Self, Self)

Split Self at offset where offset <= self.len().
Source§

unsafe fn split_at_unchecked(&self, offset: usize) -> (Self, Self)

Split Self at offset without checking offset <= self.len(). Read more

Auto Trait Implementations§

§

impl<K> !Freeze for DictionaryArray<K>

§

impl<K> !RefUnwindSafe for DictionaryArray<K>

§

impl<K> Send for DictionaryArray<K>

§

impl<K> Sync for DictionaryArray<K>

§

impl<K> Unpin for DictionaryArray<K>
where K: Unpin,

§

impl<K> !UnwindSafe for DictionaryArray<K>

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dst: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
Source§

impl<T> DynClone for T
where T: Clone,

Source§

fn __clone_box(&self, _: Private) -> *mut ()

Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize = _

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> SlicedArray for T
where T: Array + Clone,

Source§

fn slice_typed(&self, offset: usize, length: usize) -> T

Slices this Array. Read more
Source§

unsafe fn slice_typed_unchecked(&self, offset: usize, length: usize) -> T

Slices the Array. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V