Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Optimize switch fallthrough #2054

Merged
merged 4 commits into from
Feb 14, 2024
Merged

Conversation

DanielFi
Copy link
Contributor

@DanielFi DanielFi commented Dec 8, 2023

I profiled jadx and noticed that RegionMaker::processSwitch was taking up a surprising amount of the total de-compilation time (1.92% on my laptop with a certain real-world apk).
Apps that use Redex for optimization tend to have many large switch cases, which amplifies this problem.

The computationally hardest part of processSwitch is calculating the post dominators for almost every case in the switch. I improved this by only calculating the post dominators map once for the whole switch and reusing it, instead of once for each case.

While doing so, I noticed that most other calls to BlockUtils::calcPartialPostDominance were computing the post dominance for the entire method, so I added a "cache" to BlockUtils::calcPostDominance in order to avoid recomputing the same thing over and over.

Copy link
Owner

@skylot skylot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@DanielFi thank you for discovering this issue.
As I mention in code comment, it will take some time for me to revisit and check a new approach, so please be patient 🙂

Related issue #826

fallThroughBlock = getOneIntersectionBlock(out, caseBlocks, df);
} else {
if (postDomMap == null) {
Set<BlockNode> allPathsBlocks = BlockUtils.getAllPathsBlocks(block, out);
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like your change is semantically different from original algorithm. You are using here constant in loop block variable, but it was successor before, and it changes in each iteration, that is why these result were calculated every time.
For now, am I not sure why it was done this way and as soon as test didn't fail maybe we really can use your approach. Although, I want to investigate and check this on other cases, so it may take some time.
And, if possible, please share your sample apks, so I can also profile this code.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the quick response!

It is slightly semantically different, since I'm computing the post-dom map for the whole switch once, instead of calculating a separate post-dom map for each case in the switch.

As for a sample APK, WhatsApp is as an example of an app that has been optimized by Redex and spends a lot of time in processSwitch, making it relevant in this case.

return calcPartialPostDominance(mth, mth.getBasicBlocks(), mth.getPreExitBlocks().get(0));
Map<BlockNode, BitSet> postDominanceMap = mth.getPostDominanceMap();
if (postDominanceMap == null) {
postDominanceMap = calcPartialPostDominance(mth, mth.getBasicBlocks(), mth.getPreExitBlocks().get(0));
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Caching here may cause issues, because basic blocks tree changing during 'blocks' stage, and as soon as this is a utility method, we need some guard to prevent using it at that stage.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm assuming that by "blocks stage" you mean the passes relating to blocks IR (BlocksSplitter and BlockProcessor). Is that correct? Or is the CFG modified again in later passes?

I've looked into how you've implemented dominator computation in BlockProcessor, and it seems that the same thing can be done for post dominators. Simply compute all post dominators BlockProcessor and store them as a postDoms field in BlockNode. What do you think?

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

by "blocks stage" you mean the passes relating to blocks IR

Yes, and blocks are locked at the end of BlockProcessor and not modified later.

Simply compute all post dominators BlockProcessor and store them as a postDoms field in BlockNode. What do you think?

This is how it supposes to be and this way it is easier to use in other places 👍

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The locking looks very cool!
I'll start working on it then 💪

@skylot
Copy link
Owner

skylot commented Dec 8, 2023

Update:
As I mentioned in #826 (comment)

I use post-dominance info for search out block, but as soon as post-dominance algorithm expects only one exit node I use partial calculation on the needed subgraph

But after that I actually change blocks structure to have only one exit node, so now we really can use calc post-dominance for whole method and cache it 🎉
So only guard in utility method is left, but it is a minor issue and can be fixed later 😄

@skylot skylot added this to the TBD milestone Dec 23, 2023
@skylot skylot added enhancement Core Issues in jadx-core module labels Dec 23, 2023
@skylot
Copy link
Owner

skylot commented Jan 17, 2024

I'll start working on it then 💪

@DanielFi is there are any progress?
If you want, I can merge your PR as is and apply mentioned changes myself 🙂

@skylot
Copy link
Owner

skylot commented Feb 14, 2024

Ok, I implemented calculation of post-dominators using the same approach and algorithm as dominators.
Also, I adjusted switch out block calculation to use this post-dom info. This change not equivalent to previous approach, but it looks easier to understand.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
Core Issues in jadx-core module enhancement
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants