-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Minor: Add SMJ to TPCH benchmark usage #10747
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Conversation
I checked the TPCH benchmarks passes with SMJ on and row counts are the same
|
tpch_mem: TPCH inspired benchmark on Scale Factor (SF) 1 (~1GB), query from memory | ||
tpch10: TPCH inspired benchmark on Scale Factor (SF) 10 (~10GB), single parquet file per table | ||
tpch10: TPCH inspired benchmark on Scale Factor (SF) 10 (~10GB), single parquet file per table, hash join | ||
tpch_smj10: TPCH inspired benchmark on Scale Factor (SF) 10 (~10GB), single parquet file per table, sort merge join |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm planning to get rid of tpch_smj* soon and get the hash join type from the user input any bench can run with a choice of join type
Benchmark resultsBenchmarks comparing d6ddd23 (main) and 8353d20 (PR)
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @comphead
* Fix: Sort Merge Join crashes on TPCH Q21 * Fix LeftAnti SMJ join when the join filter is set * rm dbg * Add SMJ to TPCH benchmark usage
Which issue does this PR close?
Closes #10100 .
Rationale for this change
Basically fix for #10380 fixed the issue, I just fixing also the usage info for SMJ
What changes are included in this PR?
Are these changes tested?
Are there any user-facing changes?