HLL_COUNT_ESTIMATE

A scalar function. Extracts a cardinality estimate of a single Apache DataSketches HLL sketches that was previously built using the aggregate function APACHE_DATASKETCHES_HLL_BUILD.

The APACHE_DATASKETCHES_HLL_ESTIMATE function is used to estimate the cardinality (number of unique elements) of a dataset represented by an HLL (HyperLogLog) sketch. This is particularly useful for large datasets where exact counting is computationally expensive.

Syntax

APACHE_DATASKETCHES_HLL_ESTIMATE
(<expression>)

Parameters

Parameter Description Supported input types
<expression> An Apache DataSketches HLL sketches in a valid format, e.g. the output of the APACHE_DATASKETCHES_HLL_BUILD function. BYTEA

Return Type

BIGINT

Error Handling

If the input expression is not a valid Apache DataSketches HLL sketches, the function will raise an error. Ensure that the input is correctly formatted and generated by the APACHE_DATASKETCHES_HLL_BUILD function.

Example

Following the example in APACHE_DATASKETCHES_HLL_BUILD:

SELECT APACHE_DATASKETCHES_HLL_ESTIMATE(a) AS estimate
FROM sketch_of_data_to_count
ORDER BY 1;
estimate (BIGINT)
3333526
5001149
SELECT APACHE_DATASKETCHES_HLL_ESTIMATE(APACHE_DATASKETCHES_HLL_MERGE(a)) AS estimate
FROM sketch_of_data_to_count;
hll_estimate (BIGINT)
6673219