Refactor `binary_search_by` to use conditional moves #117722

okaneco · 2023-11-08T20:06:36Z

Refactor the if/else checking on cmp::Ordering variants to a "branchless" reassignment of left and right.

This change results in fewer branches and instructions.
https://rust.godbolt.org/z/698eYffTx

I saw consistent benchmark improvements locally. Performance of worst case seems about the same, maybe slightly faster for the L3 test.

Current

slice::binary_search_l1             43.00ns/iter +/- 3.00ns
slice::binary_search_l1_with_dups   25.00ns/iter +/- 0.00ns
slice::binary_search_l1_worst_case  10.00ns/iter +/- 0.00ns
slice::binary_search_l2             64.00ns/iter +/- 1.00ns
slice::binary_search_l2_with_dups   42.00ns/iter +/- 0.00ns
slice::binary_search_l2_worst_case  16.00ns/iter +/- 0.00ns
slice::binary_search_l3            132.00ns/iter +/- 2.00ns
slice::binary_search_l3_with_dups  108.00ns/iter +/- 2.00ns
slice::binary_search_l3_worst_case  33.00ns/iter +/- 3.00ns

This PR

slice::binary_search_l1            21.00ns/iter +/- 0.00ns
slice::binary_search_l1_with_dups  14.00ns/iter +/- 0.00ns
slice::binary_search_l1_worst_case  9.00ns/iter +/- 0.00ns
slice::binary_search_l2            34.00ns/iter +/- 0.00ns
slice::binary_search_l2_with_dups  23.00ns/iter +/- 0.00ns
slice::binary_search_l2_worst_case 16.00ns/iter +/- 0.00ns
slice::binary_search_l3            92.00ns/iter +/- 3.00ns
slice::binary_search_l3_with_dups  63.00ns/iter +/- 1.00ns
slice::binary_search_l3_worst_case 29.00ns/iter +/- 0.00ns

Refactor the if/else checking on cmp::Ordering variants to a "branchless" reassignment of left and right. This change results in fewer branches and instructions.

rustbot · 2023-11-08T20:06:44Z

r? @thomcc

(rustbot has picked a reviewer for you, use r? to override)

thomcc · 2023-11-24T07:09:53Z

@bors r+ rollup=never

bors · 2023-11-24T07:09:55Z

📌 Commit d585eec has been approved by thomcc

It is now in the queue for this repository.

bors · 2023-11-24T07:23:08Z

⌛ Testing commit d585eec with merge 8abf920...

bors · 2023-11-24T09:20:17Z

☀️ Test successful - checks-actions
Approved by: thomcc
Pushing 8abf920 to master...

rust-timer · 2023-11-24T11:02:24Z

Finished benchmarking commit (8abf920): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression
cc @rust-lang/wg-compiler-performance

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.4%	[0.4%, 0.4%]	1
Regressions ❌ (secondary)	1.3%	[1.3%, 1.4%]	2
Improvements ✅ (primary)	-1.4%	[-1.9%, -0.2%]	5
Improvements ✅ (secondary)	-1.8%	[-2.6%, -1.3%]	8
All ❌✅ (primary)	-1.1%	[-1.9%, 0.4%]	6

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	3.3%	[1.8%, 4.9%]	2
Regressions ❌ (secondary)	1.3%	[1.3%, 1.3%]	1
Improvements ✅ (primary)	-4.9%	[-7.2%, -2.7%]	2
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-0.8%	[-7.2%, 4.9%]	4

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-1.6%	[-1.9%, -1.0%]	7
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-1.6%	[-1.9%, -1.0%]	7

Binary size

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.1%	[0.1%, 0.1%]	1
Regressions ❌ (secondary)	0.1%	[0.1%, 0.2%]	3
Improvements ✅ (primary)	-0.9%	[-1.9%, -0.1%]	9
Improvements ✅ (secondary)	-0.7%	[-1.3%, -0.2%]	2
All ❌✅ (primary)	-0.8%	[-1.9%, 0.1%]	10

Bootstrap: 676.001s -> 675.075s (-0.14%)
Artifact size: 313.55 MiB -> 313.48 MiB (-0.02%)

pnkfelix · 2023-11-29T17:52:44Z

The single primary regression here seems to be a measurement blip, based on the 30-day history.
Even if it weren't, the improvements would outweigh the regression.
Marked as triaged.

@rustbot label: +perf-regression-triaged

Refactor binary_search_by to use conditional moves

d585eec

Refactor the if/else checking on cmp::Ordering variants to a "branchless" reassignment of left and right. This change results in fewer branches and instructions.

rustbot assigned thomcc Nov 8, 2023

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Nov 8, 2023

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 24, 2023

bors added the merged-by-bors This PR was explicitly merged by bors. label Nov 24, 2023

bors merged commit 8abf920 into rust-lang:master Nov 24, 2023

rustbot added this to the 1.76.0 milestone Nov 24, 2023

rustbot added the perf-regression Performance regression. label Nov 24, 2023

okaneco mentioned this pull request Nov 24, 2023

performance regression of binary_search #115271

Closed

okaneco deleted the binarysearch branch November 25, 2023 13:41

rustbot added the perf-regression-triaged The performance regression has been triaged. label Nov 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `binary_search_by` to use conditional moves #117722

Refactor `binary_search_by` to use conditional moves #117722

okaneco commented Nov 8, 2023

rustbot commented Nov 8, 2023

thomcc commented Nov 24, 2023

bors commented Nov 24, 2023

bors commented Nov 24, 2023

bors commented Nov 24, 2023

rust-timer commented Nov 24, 2023

pnkfelix commented Nov 29, 2023

Refactor binary_search_by to use conditional moves #117722

Refactor binary_search_by to use conditional moves #117722

Conversation

okaneco commented Nov 8, 2023

rustbot commented Nov 8, 2023

thomcc commented Nov 24, 2023

bors commented Nov 24, 2023

bors commented Nov 24, 2023

bors commented Nov 24, 2023

rust-timer commented Nov 24, 2023

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Binary size

pnkfelix commented Nov 29, 2023

Refactor `binary_search_by` to use conditional moves #117722

Refactor `binary_search_by` to use conditional moves #117722