You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
This is a follow up to #962.
Currently we cannot compute all the statistics at instantiation because some implementations of the ExecutionPlan trait are mutable.
Describe the solution you'd like
The proposition is to make all ExecutionPlan implementations immutable and then transform the statistics method signature to: fn statistics(&self) -> &Statistics;
Describe alternatives you've considered
Instead of trying to compute the statistics at instantiation, a solution could be to have statistics(&mut self) and cache the result from there. But if the plan is mutable this is error prone because the cached statistics needs to be correctly invalidated when updates occur.
* Add test that invokes bloom_filter_agg.
* QueryPlanSerde support for BloomFilterAgg.
* Add bloom_filter_agg based on sample UDAF. planner instantiates it now. Added spark_bit_array_tests.
* Partial work on Accumulator. Need to finish merge_batch and state.
* BloomFilterAgg state, merge_state, and evaluate. Need more tests.
* Matches Spark behavior. Need to clean up the code quite a bit, and do `cargo clippy`.
* Remove old comment.
* Clippy. Increase bloom filter size back to Spark's default.
* API cleanup.
* API cleanup.
* Add BloomFilterAgg benchmark to CometExecBenchmark
* Docs.
* API cleanup, fix merge_bits to update cardinality.
* Refactor merge_bits to update bit_count with the bit merging.
* Remove benchmark results file.
* Docs.
* Add native side benchmarks.
* Adjust benchmark parameters to match Spark defaults.
* Address review feedback.
* Add assertion to merge_batch.
* Address some review feedback.
* Only generate native BloomFilterAgg if child has LongType.
* Add TODO with GitHub issue link.
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
This is a follow up to #962.
Currently we cannot compute all the statistics at instantiation because some implementations of the
ExecutionPlan
trait are mutable.Describe the solution you'd like
The proposition is to make all ExecutionPlan implementations immutable and then transform the statistics method signature to:
fn statistics(&self) -> &Statistics;
Describe alternatives you've considered
Instead of trying to compute the statistics at instantiation, a solution could be to have
statistics(&mut self)
and cache the result from there. But if the plan is mutable this is error prone because the cached statistics needs to be correctly invalidated when updates occur.Additional context
#965 (comment)
The text was updated successfully, but these errors were encountered: