Separate HappyTreeFunctions for internal and leaf nodes #864

aprokop · 2023-05-09T00:09:22Z

The goal of this PR is to separate the functions for leaf and internal nodes based on tags.

aprokop · 2023-05-09T00:12:08Z

GitLab results are pretty good, with minimal slowdowns. I'll double check on AMD CPU and MI250X, and will check DBSCAN and MST results on both V100 and MI250X.

Ran DBSCAN on small and large HACC problems on MI250X, no issues. However, there's a bit of slowdown for MST (~5% total, ~8% boruvka).

aprokop · 2023-05-09T03:02:56Z

HIP benchmark results are unsatisfactory, and are very different from CUDA.

src/details/ArborX_DetailsHalfTraversal.hpp

dalg24 · 2023-05-09T23:21:55Z

Comparison against current master or pre-merge of #860 ?

aprokop · 2023-05-10T02:18:31Z

Comparison against current master or pre-merge of #860 ?

For benchmarks:

Against the current master: GitLab CI shows 17-21% slowdown in Serial knn for Power9 with everything else parity. CUDA no changes, Serial on AMD CPU no changes, HIP no changes. I don't understand why Power9 is slowed down, and am willing to accept it.

Against the pre-#860 master, about 5-6% construction/5-8% radius/0% knn in Serial for AMD CPU, 6-14% construction/0% radius/0% knn on V100, 5-6% construction/7-8% radius/-5-10% knn on MI250X.

dalg24

I think we might be using the wrong encoding for the left child and rope now that we are storing them in two independent arrays.

src/details/ArborX_MinimumSpanningTree.hpp

aprokop · 2023-05-11T05:31:56Z

I think we might be using the wrong encoding for the left child and rope now that we are storing them in two independent arrays.

I tried a different scheme where internal nodes are always negative (offset by -1), leaf nodes are nonnegative. The code is indeed simpler, however, there's no performance difference compared to the current master (see gitlab).

Moreover, and the real stopping block, is that I am afraid to touch minimum spanning tree. It is a complete mess, where the current scheme is hardcoded so tightly in multiple arrays, keeping track of everything through permuted indices, and the whole thing is so fragile, that I believe it would take days to make any progress on it. I really don't want to spend time doing that for little benefit.

If we do want to tackle it, I think we first need to try to switch the order of internal and leaf nodes in the current hierarchy, so that leaf nodes are $[0, n)$ and internal nodes are $[n, 2n-1)$. I think that would cut down on the number of index transformations in MST, getting rid of a ton of component - n + 1 kind of things.

src/details/ArborX_DetailsTreeTraversal.hpp

src/details/ArborX_DetailsTreeVisualization.hpp

src/details/ArborX_MinimumSpanningTree.hpp

aprokop · 2023-05-11T18:42:46Z

Converted to draft to figure out bvh index situation.

masterleinad

Most of the places using a ternary won't work once getBoundingVolume can return different types. It might be worth it to fix this here so the code doesn't need to be touched again.

src/details/ArborX_DetailsHappyTreeFriends.hpp

dalg24

What is the next step after this PR? Is it immediately enabling different types for the node "bounding volumes"?
I am asking because it looks like half of the code changed here would need to be changed again rightaway.

src/details/ArborX_MinimumSpanningTree.hpp

src/details/ArborX_DetailsHappyTreeFriends.hpp

aprokop · 2023-05-16T18:54:14Z

What is the next step after this PR? Is it immediately enabling different types for the node "bounding volumes"?

No, not immediately. I see at least few steps before that:

Converting leaf node to store value, which would be a pair index + bounding volume, and changing getLeafPermutationIndex to getValue
Introducing IndexableGetter internally which would use the value pair
Introducing Range
Changing the interface to allow user-defined indexable getter

The different bounding volume type for leaf and internal volumes can be part of 4, or even later.

masterleinad

Looks OK to me.

aprokop added performance Something is slower than it should be refactoring Code reorganization labels May 9, 2023

aprokop requested a review from dalg24 May 9, 2023 00:37

aprokop marked this pull request as draft May 9, 2023 03:02

aprokop force-pushed the bv_functions branch from 24a9e21 to 72da1f2 Compare May 9, 2023 20:47

aprokop marked this pull request as ready for review May 9, 2023 21:24

aprokop commented May 9, 2023

View reviewed changes

src/details/ArborX_DetailsHalfTraversal.hpp Outdated Show resolved Hide resolved

masterleinad self-requested a review May 10, 2023 21:05

dalg24 reviewed May 10, 2023

View reviewed changes

src/details/ArborX_MinimumSpanningTree.hpp Outdated Show resolved Hide resolved

masterleinad reviewed May 11, 2023

View reviewed changes

src/details/ArborX_DetailsTreeTraversal.hpp Outdated Show resolved Hide resolved

src/details/ArborX_DetailsTreeVisualization.hpp Outdated Show resolved Hide resolved

src/details/ArborX_MinimumSpanningTree.hpp Outdated Show resolved Hide resolved

aprokop marked this pull request as draft May 11, 2023 18:42

aprokop mentioned this pull request May 11, 2023

Switch ranges for leaf and internal indices #865

Merged

Separate HappyTreeFunctions for internal and leaf nodes

90adbe7

aprokop force-pushed the bv_functions branch from a730a00 to 90adbe7 Compare May 13, 2023 03:06

Simplify through doing extra calculations

0c979b4

aprokop marked this pull request as ready for review May 13, 2023 04:10

Reverse some changes in tree visualization

ce3654a

masterleinad reviewed May 15, 2023

View reviewed changes

src/details/ArborX_DetailsHappyTreeFriends.hpp Show resolved Hide resolved

dalg24 reviewed May 16, 2023

View reviewed changes

src/details/ArborX_MinimumSpanningTree.hpp Outdated Show resolved Hide resolved

src/details/ArborX_DetailsHappyTreeFriends.hpp Outdated Show resolved Hide resolved

aprokop added 2 commits May 16, 2023 15:36

Switch to using 2 functions instead of tags

09b6621

Remove unnecessary using statement

f27b14e

aprokop force-pushed the bv_functions branch from 9ecdea1 to f27b14e Compare May 16, 2023 21:32

masterleinad approved these changes May 17, 2023

View reviewed changes

aprokop merged commit bf0fb97 into arborx:master May 18, 2023

aprokop deleted the bv_functions branch May 18, 2023 00:21

aprokop mentioned this pull request May 18, 2023

API v2 tracker #872

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate HappyTreeFunctions for internal and leaf nodes #864

Separate HappyTreeFunctions for internal and leaf nodes #864

aprokop commented May 9, 2023 •

edited

Loading

aprokop commented May 9, 2023 •

edited

Loading

aprokop commented May 9, 2023

dalg24 commented May 9, 2023

aprokop commented May 10, 2023 •

edited

Loading

dalg24 left a comment

aprokop commented May 11, 2023 •

edited

Loading

aprokop commented May 11, 2023

masterleinad left a comment

dalg24 left a comment

aprokop commented May 16, 2023

masterleinad left a comment

Separate HappyTreeFunctions for internal and leaf nodes #864

Separate HappyTreeFunctions for internal and leaf nodes #864

Conversation

aprokop commented May 9, 2023 • edited Loading

aprokop commented May 9, 2023 • edited Loading

aprokop commented May 9, 2023

dalg24 commented May 9, 2023

aprokop commented May 10, 2023 • edited Loading

dalg24 left a comment

Choose a reason for hiding this comment

aprokop commented May 11, 2023 • edited Loading

aprokop commented May 11, 2023

masterleinad left a comment

Choose a reason for hiding this comment

dalg24 left a comment

Choose a reason for hiding this comment

aprokop commented May 16, 2023

masterleinad left a comment

Choose a reason for hiding this comment

aprokop commented May 9, 2023 •

edited

Loading

aprokop commented May 9, 2023 •

edited

Loading

aprokop commented May 10, 2023 •

edited

Loading

aprokop commented May 11, 2023 •

edited

Loading