Struct datafusion_expr::AggregateUDF

source ·
pub struct AggregateUDF { /* private fields */ }
Expand description

Logical representation of a user-defined aggregate function (UDAF).

An aggregate function combines the values from multiple input rows into a single output “aggregate” (summary) row. It is different from a scalar function because it is stateful across batches. User defined aggregate functions can be used as normal SQL aggregate functions (GROUP BY clause) as well as window functions (OVER clause).

AggregateUDF provides DataFusion the information needed to plan and call aggregate functions, including name, type information, and a factory function to create an Accumulator instance, to perform the actual aggregation.

For more information, please see the examples:

  1. For simple use cases, use create_udaf (examples in simple_udaf.rs).

  2. For advanced use cases, use AggregateUDFImpl which provides full API access (examples in advanced_udaf.rs).

§API Note

This is a separate struct from AggregateUDFImpl to maintain backwards compatibility with the older API.

Implementations§

source§

impl AggregateUDF

source

pub fn new( name: &str, signature: &Signature, return_type: &ReturnTypeFunction, accumulator: &AccumulatorFactoryFunction, ) -> Self

👎Deprecated since 34.0.0: please implement AggregateUDFImpl instead

Create a new AggregateUDF

See AggregateUDFImpl for a more convenient way to create a AggregateUDF using trait objects

source

pub fn new_from_impl<F>(fun: F) -> AggregateUDF
where F: AggregateUDFImpl + 'static,

Create a new AggregateUDF from a [AggregateUDFImpl] trait object

Note this is the same as using the From impl (AggregateUDF::from)

source

pub fn inner(&self) -> &Arc<dyn AggregateUDFImpl>

Return the underlying AggregateUDFImpl trait object for this function

source

pub fn with_aliases( self, aliases: impl IntoIterator<Item = &'static str>, ) -> Self

Adds additional names that can be used to invoke this function, in addition to name

If you implement AggregateUDFImpl directly you should return aliases directly.

source

pub fn call(&self, args: Vec<Expr>) -> Expr

creates an Expr that calls the aggregate function.

This utility allows using the UDAF without requiring access to the registry, such as with the DataFrame API.

source

pub fn name(&self) -> &str

Returns this function’s name

See AggregateUDFImpl::name for more details.

source

pub fn aliases(&self) -> &[String]

Returns the aliases for this function.

source

pub fn signature(&self) -> &Signature

Returns this function’s signature (what input types are accepted)

See AggregateUDFImpl::signature for more details.

source

pub fn return_type(&self, args: &[DataType]) -> Result<DataType>

Return the type of the function given its input types

See AggregateUDFImpl::return_type for more details.

source

pub fn accumulator( &self, acc_args: AccumulatorArgs<'_>, ) -> Result<Box<dyn Accumulator>>

Return an accumulator the given aggregate, given its return datatype

source

pub fn state_fields(&self, args: StateFieldsArgs<'_>) -> Result<Vec<Field>>

Return the fields used to store the intermediate state for this aggregator, given the name of the aggregate, value type and ordering fields. See AggregateUDFImpl::state_fields for more details.

This is used to support multi-phase aggregations

source

pub fn groups_accumulator_supported(&self, args: AccumulatorArgs<'_>) -> bool

source

pub fn create_groups_accumulator( &self, args: AccumulatorArgs<'_>, ) -> Result<Box<dyn GroupsAccumulator>>

source

pub fn create_sliding_accumulator( &self, args: AccumulatorArgs<'_>, ) -> Result<Box<dyn Accumulator>>

source

pub fn coerce_types(&self, arg_types: &[DataType]) -> Result<Vec<DataType>>

source

pub fn with_beneficial_ordering( self, beneficial_ordering: bool, ) -> Result<Option<AggregateUDF>>

source

pub fn order_sensitivity(&self) -> AggregateOrderSensitivity

Gets the order sensitivity of the UDF. See AggregateOrderSensitivity for possible options.

source

pub fn reverse_udf(&self) -> ReversedUDAF

Reserves the AggregateUDF (e.g. returns the AggregateUDF that will generate same result with this AggregateUDF when iterated in reverse order, and None if there is no such AggregateUDF).

source

pub fn simplify(&self) -> Option<AggregateFunctionSimplification>

Do the function rewrite

See AggregateUDFImpl::simplify for more details.

source

pub fn is_descending(&self) -> Option<bool>

Returns true if the function is max, false if the function is min None in all other cases, used in certain optimizations or or aggregate

Trait Implementations§

source§

impl Clone for AggregateUDF

source§

fn clone(&self) -> AggregateUDF

Returns a copy of the value. Read more
1.0.0 · source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
source§

impl Debug for AggregateUDF

source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
source§

impl Display for AggregateUDF

source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
source§

impl<F> From<F> for AggregateUDF
where F: AggregateUDFImpl + Send + Sync + 'static,

source§

fn from(fun: F) -> Self

Converts to this type from the input type.
source§

impl Hash for AggregateUDF

source§

fn hash<H: Hasher>(&self, state: &mut H)

Feeds this value into the given Hasher. Read more
1.3.0 · source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
source§

impl PartialEq for AggregateUDF

source§

fn eq(&self, other: &Self) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
source§

impl Eq for AggregateUDF

Auto Trait Implementations§

Blanket Implementations§

source§

impl<T> Any for T
where T: 'static + ?Sized,

source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
source§

impl<T> Borrow<T> for T
where T: ?Sized,

source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
source§

impl<T> CloneToUninit for T
where T: Clone,

source§

default unsafe fn clone_to_uninit(&self, dst: *mut T)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
source§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

source§

fn equivalent(&self, key: &K) -> bool

Checks if this value is equivalent to the given key. Read more
source§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

source§

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.
source§

impl<T> From<T> for T

source§

fn from(t: T) -> T

Returns the argument unchanged.

source§

impl<T, U> Into<U> for T
where U: From<T>,

source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

source§

impl<T> ToOwned for T
where T: Clone,

§

type Owned = T

The resulting type after obtaining ownership.
source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
source§

impl<T> ToString for T
where T: Display + ?Sized,

source§

default fn to_string(&self) -> String

Converts the given value to a String. Read more
source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

§

type Error = Infallible

The type returned in the event of a conversion error.
source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.