Struct Toc

pub struct Toc { /* private fields */ }

Expand description

Trie Occurrence Counter is frequency dictionary that uses any impl Iterator<Item = char> type as occurrent.

OOB English letters A–Za–z support only.

use t_oc::Toc;
use std::panic::catch_unwind;

let mut toc = Toc::new();
let occurrent = "true";

_ = toc.add(occurrent.chars(), None);
_ = toc.add(true.to_string().chars(), None);

assert_eq!(2, toc.acq(occurrent.chars()).uproot());
toc.put(occurrent.chars(), 15);
assert_eq!(15, toc.acq(occurrent.chars()).uproot());

let catch = catch_unwind(move|| _ = toc.add("#&%".chars(), None));
assert!(catch.is_err());

When asymptotic computational complexity is not explicitly specified , it is:

TC: Θ(c) where c is count of chars iterated over.
SC: Θ(0).

Implementations§

impl Toc

pub fn new() -> Self

Constructs default version of Toc, i.e. via fn new_with() with english_letters::{ix, re, ALPHABET_LEN}.

pub fn new_with(ix: Ix, re: Option<Re>, ab_len: usize) -> Self

Allows to use custom alphabet different from default alphabet.

use t_oc::Toc;

fn ix(c: char) -> usize {
    match c {
        '&' => 0,
        '|' => 1,
        _ => panic!(),
    }
}

// if `fn Toc::ext` will not be used, pass `None` for `re`
fn re(i: usize) -> char {
    match i {
        0 => '&',
        1 => '|',
        _ => panic!(),
    }
}    

let ab_len = 2;

let mut toc = Toc::new_with(ix, Some(re), ab_len);
let a = "&";
let b = "|";
let aba = "&|&";
_ = toc.add(a.chars(), None);
_ = toc.add(a.chars(), None);
_ = toc.add(b.chars(), None);
_ = toc.add(aba.chars(), None);
assert_eq!(2, toc.acq(a.chars()).uproot());
assert_eq!(1, toc.acq(aba.chars()).uproot());

pub fn put_trace_cap(&mut self, approx_cap: usize) -> usize

Used to set internal backtracing buffer capacity.

Toc uses internal buffer, to avoid excessive allocations and copying, which grows over time due backtracing in rem method which backtraces whole path from entry node to root node.

Use this method to shrink or extend it to fit actual program needs. Neither shrinking nor extending is guaranteed to be exact. See Vec::with_capacity() and Vec::reserve(). For optimal rem performance, set approx_cap to, at least, occurrent.count().

Some high value is sufficient anyway. Since buffer continuous usage, its capacity will likely expand at some point in time to size sufficient to all occurrents.

Return value is actual buffer capacity.

Note: While String is UTF8 encoded, its byte length does not have to equal its char count which is either equal or lesser.

let sights = "🤩";
assert_eq!(4, sights.len());
assert_eq!(1, sights.chars().count());

let yes = "sí";
assert_eq!(3, yes.len());
assert_eq!(2, yes.chars().nth(1).unwrap().len_utf8());

let abc = "abc";
assert_eq!(3, abc.len());

pub fn acq_trace_cap(&self) -> usize

Used to obtain internal backtracing buffer capacity.

Check with fn put_trace_cap for details.

pub fn add( &mut self, occurrent: impl Iterator<Item = char>, val: Option<usize>, ) -> AddRes

Used to add occurence to tree.

Counter is of word size. Add overflow is wrapped using wrapping_add.

Optional val parameter can be used to insert exact value.

Return value is AddRes::Ok(Option<usize>) for non-zero occurrent and holds previous value, if there was some.

SC: Θ(q) where q is number of unique nodes, i.e. chars in respective branches.

pub fn acq(&self, occurrent: impl Iterator<Item = char>) -> VerRes

Used to acquire value for occurrent.

If VerRes::Ok(usize), usize is occurrent occurrences count.

pub fn put( &mut self, occurrent: impl Iterator<Item = char>, val: usize, ) -> VerRes

Used to put new value for occurrent occurrences.

If VerRes::Ok(usize), usize is previous value.

pub fn rem(&mut self, occurrent: impl Iterator<Item = char>) -> VerRes

Used to remove occurrent from tree.

If VerRes::Ok(usize), usize is occurrent occurrences count.

c is count of chars iterated over.
TC: Ω(c) or ϴ(c) (backtracing buffer capacity dependent complexity).
SC: ϴ(c).

Check with put_trace_cap for details on backtracing.

pub fn ext(&self) -> Vec<(String, usize)>

Used to extract occurences from tree.

Extraction is alphabetically ordered. Does not clear tree. Use fn clr for clearing.

TC: Ω(n) where n is count of nodes in tree.
SC: Θ(s) where s is occurrent lengths summation.

pub fn clr(&mut self)

Used to clear tree.

TC: Θ(n) where n is count of nodes in tree.

pub const fn ct(&self) -> usize

Used to acquire count of occurrents in tree.

Auto Trait Implementations§

impl Freeze for Toc

impl RefUnwindSafe for Toc

impl !Send for Toc

impl !Sync for Toc

impl Unpin for Toc

impl UnwindSafe for Toc

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.