Speed up hash partitioning #6822

Dandandan · 2023-07-02T09:37:33Z

Is your feature request related to a problem or challenge?

Also see request in arrow apache/arrow-rs#4476

In DataFusion, a common operation is to repartition a RecordBatch by hashing one or more columns and dividing them into partition record batches using the "formula" hash % num_partitions.

The current approach is to create the indices that match and use them to take the individual arrays (see BatchPartitioner in datafusion).

This is relatively expensive however, as we visit the arrays num_partitions times in different places of the array, leading to cache inefficient operators (especially when the number of partitions is high).

Describe the solution you'd like

Faster hash-partitioning implementation

Describe alternatives you've considered

No response

Additional context

No response

alamb · 2023-07-03T19:30:53Z

I recommend we look into implementing Selection Vectors / bitmaks -- then repartitioning could become a calculation of such filters/ bitmasks

zebsme · 2025-03-22T14:17:37Z

hi @Dandandan @alamb, I tried some experiments by running tpch benchmarks.
And would like to share my findings for others who might be interested in this:

Bitmask/filter is a bit slower than current implementation.
Flattening the nested Vec can improve performance for some queries. However, for some other queries, it can actually slow things down, possibly due to increased memory or less efficient access patterns.
Prefix sum requires random access，which leads to bad performance.

alamb · 2025-03-23T20:05:23Z

Thanks for checking this out @zebsme

I don't really have any other ideas

Dandandan · 2025-03-24T10:31:43Z

I wrote some ideas of supporting selection vectors inside hash join and aggregate (I believe we didn't have those issues?)

This seems to be likely to give more substantial gains than trying to optimize only the partitioning code only as, even optimized, we still need to copy the inputs (and run CoalesceBatchesExec).

#15382
#15383

I think (at least for join, aggregate I am less certain) it might not be too hard to implement.

alamb · 2025-03-24T20:38:12Z

I believe the plans also effectively hash the group keys three times for aggregate plans:

initial hash to find the group in the initial aggregate phase
hash to compute the output partition
hash (again) to find the group in the final aggregation phase

Passing along the pre-computed hashes, especially for strings, might be significiantly faster

Dandandan added the enhancement New feature or request label Jul 2, 2023

Dandandan changed the title ~~Speed up partitioning operator~~ Speed up hash partitioning operator Jul 2, 2023

Dandandan changed the title ~~Speed up hash partitioning operator~~ Speed up hash partitioning Jul 2, 2023

Dandandan mentioned this issue Jul 2, 2023

[EPIC] A list of performance improvement tickets #5546

Closed

30 tasks

Dandandan added the performance Make DataFusion faster label Jul 19, 2023

Dandandan mentioned this issue Jul 19, 2023

[EPIC] (Even More) Grouping / Group By / Aggregation Performance #7000

Open

17 tasks

alamb mentioned this issue Feb 4, 2025

[EPIC] A(nother) list of performance improvement tickets #14482

Open

10 tasks

Dandandan mentioned this issue Apr 18, 2025

Use interleave to speed up hash repartitioning #15768

Closed

ctsk mentioned this issue May 7, 2025

Optimize hash partitioning for cache friendliness #15981

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up hash partitioning #6822

Speed up hash partitioning #6822

Dandandan commented Jul 2, 2023 •

edited

Loading

alamb commented Jul 3, 2023

zebsme commented Mar 22, 2025

alamb commented Mar 23, 2025

Dandandan commented Mar 24, 2025

alamb commented Mar 24, 2025

Speed up hash partitioning #6822

Speed up hash partitioning #6822

Comments

Dandandan commented Jul 2, 2023 • edited Loading

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

alamb commented Jul 3, 2023

zebsme commented Mar 22, 2025

alamb commented Mar 23, 2025

Dandandan commented Mar 24, 2025

alamb commented Mar 24, 2025

Dandandan commented Jul 2, 2023 •

edited

Loading