Expand description
KMeans implementation for Apache Arrow Arrays.
Support l2
, cosine
and dot
distances, see DistanceType.
Cosine
distance are calculated by normalizing the vectors to unit length,
and run l2
distance on the unit vectors.
Structs§
- KMeans implementation for Apache Arrow Arrays.
- KMean Training Parameters
Enums§
- KMean initialization method.
Functions§
- Compute partition ID of each vector in the KMeans.
- Compute partitions from Arrow FixedSizeListArray.
- KMeans finds N nearest partitions.