incr.comp.: Use a set implementation optimized for small item counts for deduplicating read-edges. #45577

michaelwoerister · 2017-10-27T15:36:57Z

Many kinds of DepNodes will only ever have between zero and three edges originating from them (see e.g. #45063 (comment)) so let's try to avoid allocating a HashSet in those cases.

r? @nikomatsakis

…for deduplicating read-edges.

leonardo-m · 2017-10-27T20:44:55Z

Something like this? https://crates.io/crates/vec_map

kennytm · 2017-10-28T07:51:09Z

@leonardo-m No, this structure is more like https://docs.rs/david-set/0.1.2/david_set/struct.Set.html.

arielb1 · 2017-10-28T13:19:24Z

src/librustc/dep_graph/graph.rs

+// Many kinds of nodes often only have between 0 and 3 edges, so we provide a
+// specialized set implementation that does not allocate for those some counts.
+#[derive(Debug, PartialEq, Eq)]
+enum DepNodeIndexSet {


Could you move this to a generic data structure in rustc_data_structures?

arielb1 · 2017-10-28T13:19:43Z

Could you try and get some numbers on the performance improvement or the hit rate?

Even if you don't, r=me with the set moved to rustc_data_structures.

kennytm · 2017-10-28T13:51:39Z

@bors try

Preparing for perf.

bors · 2017-10-28T13:51:49Z

⌛ Trying commit be27d8b with merge 877833f...

@nikomatsakis

incr.comp.: Use a set implementation optimized for small item counts for deduplicating read-edges. Many kinds of `DepNodes` will only ever have between zero and three edges originating from them (see e.g. #45063 (comment)) so let's try to avoid allocating a `HashSet` in those cases. r? @nikomatsakis

bors · 2017-10-28T15:12:07Z

☀️ Test successful - status-travis
State: approved= try=True

kennytm · 2017-10-28T15:48:23Z

@rust-lang/infra perf check requested from #45577 (comment).

arthurprs · 2017-10-28T21:11:00Z

This is fine, but an enum InlineSet<T> {Inline(ArrayVec<T>), Extended(FxHashSet<T>) } would be nicer and reusable.

Note: ArrayVec is already in the codebase.

Mark-Simulacrum · 2017-10-29T23:30:51Z

http://perf.rust-lang.org/compare.html?start=7da9a5e178e28b2e387e6296aa1b0289acdf5781&end=877833ffc410fea76cf3eb1bee275ccb92a43e7d

kennytm · 2017-10-30T09:00:53Z

The improvement/regression is negligible. There is even a +17.4% max-rss (memory use) in the regex-opt-0.1.80@020-incr-from... test case. Overall landing this PR does not seem beneficial.

Measure	∑before	∑after	Δ
cpu-clock	414267.47	411066.66	-0.77%
cycles:u	1582921482931	1570998057890	-0.75%
faults	1842937	1836494	-0.35%
instructions:u	1832233383606	1832161663635	-0.00%
max-rss	17936396	18100956	+0.92%
task-clock	413244.27	409110.92	-1.00%
wall-time	220.22	220.70	+0.22%

michaelwoerister · 2017-10-30T09:34:34Z

Thanks for kicking off the performance measurement, @kennytm!

Could you try and get some numbers on the performance improvement or the hit rate?

@julian-seward1 and I ran into this function as on of the hotter ones while profiling a incremental building of the regex crate. The compiler spends roughly 0.9% and 1.6% of cycles in the read_index() function, depending on how much of the dependency graph can be marked as green. So that's the upper limit of the performance improvement here.

The main intended optimization here is to get rid of the heap allocation for small hash sets. It's a bit surprising that the effect on the overall instruction count is so small. I would have hoped for -0.5% instead of -0.1%.

There is even a +17.4% max-rss (memory use) in the regex-opt-0.1.80@020-incr-from... test case.

This must be some other effect. I don't see how this could introduce any noticeable increase in memory consumption.

Thanks for your comments, everyone! Maybe I'll tinker some more with this in my spare time. Closing for now.

kennytm · 2017-10-30T09:44:21Z

The main intended optimization here is to get rid of the heap allocation for small hash sets. It's a bit surprising that the effect on the overall instruction count is so small. I would have hoped for -0.5% instead of -0.1%.

@michaelwoerister Probably it's because jemalloc is very efficient? 😄

incr.comp.: Use a set implementation optimized for small item counts …

be27d8b

…for deduplicating read-edges.

rust-highfive assigned nikomatsakis Oct 27, 2017

kennytm added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 27, 2017

arielb1 reviewed Oct 28, 2017

View reviewed changes

kennytm added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 28, 2017

michaelwoerister closed this Oct 30, 2017

michaelwoerister mentioned this pull request May 9, 2018

Use SmallVec for DepNodeIndex within dep_graph. #50565

Merged

michaelwoerister mentioned this pull request Dec 13, 2018

[experiment] Allocate DepGraph::read_index de-duplication hashset more lazily. #56724

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

incr.comp.: Use a set implementation optimized for small item counts for deduplicating read-edges. #45577

incr.comp.: Use a set implementation optimized for small item counts for deduplicating read-edges. #45577

Uh oh!

michaelwoerister commented Oct 27, 2017

Uh oh!

leonardo-m commented Oct 27, 2017

Uh oh!

kennytm commented Oct 28, 2017

Uh oh!

arielb1 Oct 28, 2017

Uh oh!

arielb1 commented Oct 28, 2017

Uh oh!

kennytm commented Oct 28, 2017

Uh oh!

bors commented Oct 28, 2017

Uh oh!

bors commented Oct 28, 2017

Uh oh!

kennytm commented Oct 28, 2017

Uh oh!

arthurprs commented Oct 28, 2017

Uh oh!

Mark-Simulacrum commented Oct 29, 2017

Uh oh!

kennytm commented Oct 30, 2017

Uh oh!

michaelwoerister commented Oct 30, 2017 •

edited

Loading

Uh oh!

kennytm commented Oct 30, 2017

Uh oh!

Uh oh!

incr.comp.: Use a set implementation optimized for small item counts for deduplicating read-edges. #45577

incr.comp.: Use a set implementation optimized for small item counts for deduplicating read-edges. #45577

Uh oh!

Conversation

michaelwoerister commented Oct 27, 2017

Uh oh!

leonardo-m commented Oct 27, 2017

Uh oh!

kennytm commented Oct 28, 2017

Uh oh!

arielb1 Oct 28, 2017

Choose a reason for hiding this comment

Uh oh!

arielb1 commented Oct 28, 2017

Uh oh!

kennytm commented Oct 28, 2017

Uh oh!

bors commented Oct 28, 2017

Uh oh!

bors commented Oct 28, 2017

Uh oh!

kennytm commented Oct 28, 2017

Uh oh!

arthurprs commented Oct 28, 2017

Uh oh!

Mark-Simulacrum commented Oct 29, 2017

Uh oh!

kennytm commented Oct 30, 2017

Uh oh!

michaelwoerister commented Oct 30, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kennytm commented Oct 30, 2017

Uh oh!

Uh oh!

michaelwoerister commented Oct 30, 2017 •

edited

Loading