[PoC] Use timestamp_delta to weight samples #166

casperisfine · 2021-09-03T10:24:08Z

NB: this is just proof of concept quality to start a discussion. A releasable version of this patch would involve quite more work.

Problem

We recently noticed that we were overlooking a major hotspot because stackprof regular reporting is biased.

Here's the stackprof summary of one of our application boot, using stackprof 0.2.17:

==================================
  Mode: cpu(1000)
  Samples: 95925 (13.33% miss rate)
  GC: 30564 (31.86%)
==================================
     TOTAL    (pct)     SAMPLES    (pct)     FRAME
     27790  (29.0%)       27790  (29.0%)     (marking)
      6278   (6.5%)        6277   (6.5%)     RubyVM::InstructionSequence.load_from_binary
...
       311   (0.3%)         311   (0.3%)     Module#using

And now here's the same profiling data, but rendered by Speedscope:

Notice how Module#using account for almost nothing according to Stackprof, but for 11% for speedscope.

This is because stackprof's own reporting is based on the number of samples, whereas Speedscope use the raw_timestamp_deltas data that is intended for flamegraphs.

And since Module#using goes over the whole heap to flush method caches, I assume that the stackprof event isn't fired for a long amount of time. Based on the stackprof data, Module#using was sampled 311 times, but each sample took 41ms instead of 1ms.

This is particularly visible for Module#using, but from analyzing raw_timestamp_deltas, a large proportion of the samples are longer than intended.

Possible solution

Again this is just a proof of concept, but if we were to always compute the timestamp_delta, we could "weight" the samples to somewhat correct this bias.

It's a bit hard to implement because of other profiling mode such as object which aren't time based. So maybe instead of weighting the samples, we should simply record the timestamp_delta as part of the profiling, and use that instead of the sample count for reporting.

@tenderlove @XrXr what do you think?

[PoC] Use timestamp_delta to weight samples

67c695f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PoC] Use timestamp_delta to weight samples #166

[PoC] Use timestamp_delta to weight samples #166

casperisfine commented Sep 3, 2021

[PoC] Use timestamp_delta to weight samples #166

Are you sure you want to change the base?

[PoC] Use timestamp_delta to weight samples #166

Conversation

casperisfine commented Sep 3, 2021

Problem

Possible solution