fix valueLog doRunGC to use consistent units #724

mschoch · 2019-02-26T14:59:12Z

some comparisons were made between bytes and megabytes
this change tracks all values in bytes and makes comparisons
using consistent units

This change is

some comparisons were made between bytes and megabytes this change tracks all values in bytes and makes comparisons using consistent units

CLAassistant · 2019-02-26T14:59:20Z

All committers have signed the CLA.

mschoch · 2019-02-26T15:00:26Z

This could have been addressed a couple ways, I chose to just do everything in bytes instead of MB. This removes a few divisions, and the only downside is that the trace messages will output less friendly numbers.

mschoch · 2019-02-26T15:48:54Z

Not sure if the appveyor failure is related or not, but fixing this bug may expose other issues in value log GC.

Specifically, because this line was comparing bytes and megabytes, we never would have stopped sampling because the number of bytes we read:

https://github.com/dgraph-io/badger/blob/master/value.go#L1188

That means we either ran out of time (10s), read 10000 entries, or got to the end of the log. This then relates to the checks we perform here:

https://github.com/dgraph-io/badger/blob/master/value.go#L1253

Notice how the row count check here is stricter than the size check. This ends up related, because now there will be more cases where we stopped sampling due to size, thus more cases where the row count might not be satisfied any more. Obviously all of this depends on the data and tuning in practice, but it's worth pointing out.

On that same topic, I have concerns about users being able to tune this effectively. I think it makes total sense that you want lots of ways to constrain how much data is sampled (entries, size, time), but at this point we're trying to decide if the sample was good enough. It seems like satisfying size or entries ought to be enough, but as coded today we require both. I anticipate this being hard for some users to tune, as satisfying both requires you have a good idea of the value sizes, which might vary considerably in practice. Any thoughts on this check?

mschoch · 2019-02-26T17:31:07Z

Actually, perhaps we should disregard my last comment. At least when doing offline compaction, just applying this fix makes it work very well, without any other changes or tuning.

manishrjain

To add some context, we used to do it purely based on size, but if key-values are really small, then that ended up causing a lot of LSM tree lookups to sample the log file. That's why we added the criteria for the number of keys, so we can quit early if we have enough key samples.

I'd prefer keeping the units to MB and the fields in reason to floats.

Reviewable status: 0 of 1 files reviewed, all discussions resolved

fix valueLog doRunGC to use consistent units

f86b912

some comparisons were made between bytes and megabytes this change tracks all values in bytes and makes comparisons using consistent units

mschoch mentioned this pull request Feb 26, 2019

large database size #718

Closed

manishrjain reviewed Feb 26, 2019

View reviewed changes

mschoch closed this Feb 26, 2019

mschoch mentioned this pull request Feb 26, 2019

fix valueLog doRunGC to use consistent units #725

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix valueLog doRunGC to use consistent units #724

fix valueLog doRunGC to use consistent units #724

mschoch commented Feb 26, 2019 •

edited by manishrjain

Loading

CLAassistant commented Feb 26, 2019 •

edited

Loading

mschoch commented Feb 26, 2019

mschoch commented Feb 26, 2019

mschoch commented Feb 26, 2019

manishrjain left a comment

fix valueLog doRunGC to use consistent units #724

fix valueLog doRunGC to use consistent units #724

Conversation

mschoch commented Feb 26, 2019 • edited by manishrjain Loading

CLAassistant commented Feb 26, 2019 • edited Loading

mschoch commented Feb 26, 2019

mschoch commented Feb 26, 2019

mschoch commented Feb 26, 2019

manishrjain left a comment

Choose a reason for hiding this comment

mschoch commented Feb 26, 2019 •

edited by manishrjain

Loading

CLAassistant commented Feb 26, 2019 •

edited

Loading