feat: implement a dirty flag #74

Stebalien · 2020-11-19T20:09:18Z

This patch introduces a dirty flag and uses it to avoid unnecessary writes.

It avoids writes when creating new nodes. Instead, it just caches the new nodes and marks them dirty.
It only flushes modified nodes instead of all cached nodes.
Flush no longer clears the cache, just the dirty flags.

This patch introduces a dirty flag and uses it to avoid unnecessary writes. 1. It avoids writes when creating new nodes. Instead, it just caches the new nodes and marks them dirty. 2. It only flushes modified nodes instead of all cached nodes. 3. Flush no longer clears the cache, just the dirty flags.

Stebalien · 2020-11-19T20:11:33Z

So, I was planning on rewriting this the same way I did the AMT, but that wasn't really necessary in this case. Simply fixing the unnecessary writes issue turned out to be much simpler.

This appears to reduce writes by at least 40% in the "fill" benchmarks.

Note: like with AMT v2, the root node will always be written on flush. Otherwise, we only write modified nodes.

ZenGround0

LGTM
One question, what is the motivation for no longer clearing caches from memory during flush?
--edit--
I now see the request for this in #72. It looks like it's a memory / time trade off and we care more about time than memory? In the filecoin state machine we don't do much with hamts after flushing so the choice doesn't seem strongly motivated for this use. Is there a motivation I'm missing?

ZenGround0 · 2020-11-25T21:11:07Z

~~Oh also this is going to change gas accounting so we either 1) wait for me to release v3 in the near future before merging, or 2) go ahead and update this to v3 before merge yourself.~~

#76 is now merged so this can be rebased and merged.

austinabell · 2020-11-26T20:16:48Z

LGTM
One question, what is the motivation for no longer clearing caches from memory during flush?
--edit--
I now see the request for this in #72. It looks like it's a memory / time trade off and we care more about time than memory? In the filecoin state machine we don't do much with hamts after flushing so the choice doesn't seem strongly motivated for this use. Is there a motivation I'm missing?

Well, the memory would be dropped right after the structure is dropped (or whenever it's GCed) so I don't really see the difference in this case. It's not like the flushed hamt is kept in memory for any period of time really, and keeping a pointer to the node until it is doesn't seem like it incurs any cost, at the benefit of more generally usable and reusable implementation. Also, the state tree in between messages could reuse cached nodes, instead of reloading them a bunch of times, this seems to be to be the biggest benefit of this.

Also it would be a bit weird if read caches are kept, where dirty caches are dropped. That seems like unexpected behaviour to me

Stebalien requested review from rvagg and ZenGround0 November 19, 2020 20:09

This was referenced Nov 23, 2020

Redundant reads and writes #68

Closed

Keep node in cache after writing to store #72

Closed

All cached nodes are put in blockstore and removed after flush #73

Closed

ZenGround0 approved these changes Nov 25, 2020

View reviewed changes

ZenGround0 mentioned this pull request Nov 25, 2020

Extend documentation around Flush and cached nodes #61

Merged

ZenGround0 mentioned this pull request Nov 25, 2020

Update module version in preparation for breaking changes #76

Merged

ZenGround0 merged commit 83589ca into master Nov 27, 2020

ZenGround0 mentioned this pull request Nov 27, 2020

Filecoin HAMT v3 Improvements filecoin-project/FIPs#38

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement a dirty flag #74

feat: implement a dirty flag #74

Stebalien commented Nov 19, 2020

Stebalien commented Nov 19, 2020

ZenGround0 left a comment •

edited

Loading

ZenGround0 commented Nov 25, 2020 •

edited

Loading

austinabell commented Nov 26, 2020 •

edited

Loading

feat: implement a dirty flag #74

feat: implement a dirty flag #74

Conversation

Stebalien commented Nov 19, 2020

Stebalien commented Nov 19, 2020

ZenGround0 left a comment • edited Loading

Choose a reason for hiding this comment

ZenGround0 commented Nov 25, 2020 • edited Loading

austinabell commented Nov 26, 2020 • edited Loading

ZenGround0 left a comment •

edited

Loading

ZenGround0 commented Nov 25, 2020 •

edited

Loading

austinabell commented Nov 26, 2020 •

edited

Loading